Compositions and methods for enhancing disease resistance in plants

ABSTRACT

Compositions and methods for enhancing or creating plant disease resistance to plant pests are provided. Transforming a plant with a novel maize, sorghum, or rice disease resistance gene homologue (RGH) of the invention enhances disease resistance of the plant. Transformed plants, plant cells, tissues, and seed having enhanced disease resistance are also provided.

CROSS-REFERENCE TO RELATED APPLICATIONS

[0001] This application is a continuation of U.S. application Ser. No. 09/354,453, filed Jul. 15, 1999 and also claims the benefit of U.S. Provisional Application Serial No. 60/093,290, filed Jul. 17, 1998, each of which are hereby incorporated in their entirety by reference herein.

FIELD OF THE INVENTION

[0002] The invention relates to the genetic manipulation of plants, particularly to transforming plants with genes that enhance disease resistance.

BACKGROUND OF THE INVENTION

[0003] Disease in plants is caused by biotic and abiotic causes. Biotic causes include fungi, viruses, bacteria, and nematodes. Of these, fungi are the most frequent causative agent of disease in plants. Abiotic causes of disease in plants include extremes of temperature, water, oxygen, soil pH, plus nutrient-element deficiencies and imbalances, excess heavy metals, and air pollution.

[0004] A host of cellular processes enables plants to defend themselves from disease caused by pathogenic agents. These processes apparently form an integrated set of resistance mechanisms that is activated by initial infection and then limits further spread of the invading pathogenic microorganism. Subsequent to recognition of a potentially pathogenic microbe, plants can activate an array of biochemical responses. Generally, the plant responds by inducing several local responses in the cells immediately surrounding the infection site. The most common resistance response observed in both nonhost and race-specific interactions is termed the “hypersensitive response” (HR). In the hypersensitive response, cells contacted by the pathogen, and often neighboring cells, rapidly collapse and dry in a necrotic fleck. Other responses include the deposition of callose, the physical thickening of cell walls by lignification, and the synthesis of various antibiotic small molecules and proteins. Genetic factors in both the host and the pathogen determine the specificity of these local responses, which can be very effective in limiting the spread of infection.

[0005] The hypersensitive response in many plant-pathogen interactions results from the expression of a resistance (R) gene in the plant and a corresponding avirulence (avr) gene in the pathogen. This interaction is associated with the rapid, localized cell death of the hypersensitive response. R genes that respond to specific bacterial, fungal, or viral pathogens, have been isolated from a variety of plant species and several appear to encode cytoplasmic proteins.

[0006] The resistance gene in the plant and the avirulence gene in the pathogen often conform to a gene-for-gene relationship. That is, resistance to a pathogen is only observed when the pathogen carries a specific avirulence gene and the plant carries a corresponding or complementing resistance gene. Because avrR gene-for-gene relationships are observed in many plant-pathogen systems and are accompanied by a characteristic set of defense responses, a common molecular mechanism underlying avrR gene mediated resistance has been postulated. A simple model which has been proposed is that pathogen avr genes directly or indirectly generate a specific molecular signal (ligand) that is recognized by cognate receptors encoded by plant R genes. Recently, direct evidence for a physical interaction between paired R and avr gene products as receptors and ligands in plant-pathogen recognition is provided by an interaction between tomato Pto and Pseudomonas syringe avrPto products (Scofield et al. (1996) Science 274:2063-2065; Tang et al. (1996) Science 274:2060-2063.

[0007] Both plant resistance genes and corresponding pathogen avirulence genes have been cloned. The plant kingdom contains thousands of R genes with specific specificities for viral, bacterial, fungal, or nematode pathogens. Although there are differences in the defense responses induced during different plant-pathogen interactions, some common themes are apparent among R gene-mediated defenses. The function of a given R gene is dependent on the genotype of the pathogen. Plant pathogens produce a diversity of potential signals, and in a fashion analogous to the production of antigens by mammalian pathogens, some of these signals are detectable by some plants.

[0008] The avirulence gene causes the pathogen to produce a signal that triggers a strong defense response in a plant with the appropriate R gene. However, expressing an avirulence gene does not stop the pathogen from being virulent on hosts that lack the corresponding R gene. A single plant can have many R genes, and a pathogen can have many avr genes.

[0009] As noted, among the causative agents of infectious disease of crop plants, the phytopathogenic fungi play the dominant role. Phytopathogenic fungi cause devastating epidemics, as well as causing significant annual crop yield losses. All of the approximately 300,000 species of flowering plants are attacked by pathogenic fungi. However, a single plant species can be host to only a few fungal species, and similarly, most fungi usually have a limited host range.

[0010] Plant disease outbreaks have resulted in catastrophic crop failures that have triggered famines and caused major social change. Generally, the best strategy for plant disease control is to use resistant cultivars selected or developed by plant breeders for this purpose. However, the potential for serious crop disease epidemics persists today, as evidenced by outbreaks of the Victoria blight of oats and southern corn leaf blight. Accordingly, molecular methods are needed to supplement traditional breeding methods to protect plants from pathogen attack.

SUMMARY OF THE INVENTION

[0011] Compositions and methods for creating or enhancing resistance to plant pests are provided. Compositions are nucleotide sequences for novel disease resistance gene homologues cloned from maize, rice, and sorghum and the amino acid sequences for the proteins or partial-length proteins or polypeptides encoded thereby. Methods of the invention involve stably transforming a plant with one of these novel disease resistance gene homologues operably linked with a promoter capable of driving expression of a nucleotide coding sequence in a plant cell. Expression of the novel nucleotide sequences confers disease resistance to a plant by interacting with the complementing phytopathogen avirulence gene product released into the plant by the invading plant pathogen. The methods of the invention find use in controlling plant pests, including fungal pathogens, viruses, nematodes, insects, and the like.

[0012] Transformed plants and seeds, as well as methods for making such plants and seeds are additionally provided.

BRIEF DESCRIPTION OF THE DRAWINGS

[0013] FIGS. 1A-1B shows an alignment of conserved regions of the deduced amino acid sequences encoded by the maize, rice, and sorghum resistance gene homologues (RGHs) of the invention with several other R genes. The alignment starts with the third amino acid residue within the kinase-2 domain, a sequence feature shared by disease resistance proteins encoded by R genes in the NBS-LRR superfamily. Three of the novel sequences shown in the alignment are from maize (M05, also referred to as M5-1, SEQ ID NO: 37; M06, also referred to as M6-1, SEQ ID NO: 38; Mr05, also referred to as M5-6, SEQ ID NO: 39), four are from rice (R0501, also referred to as R5-1, SEQ ID NO: 40; R0502, also referred to as R5-2, SEQ ID NO: 41; R0503, also referred to as R5-3, SEQ ID NO: 42; R0518, also referred to as R5-4, SEQ ID NO: 43), and six are from sorghum (S0510, also referred to as S5-5, SEQ ID NO: 44; S0535, also referred to as S5-2A, SEQ ID NO: 45; S0545, also referred to as S5-2B, SEQ ID NO: 46; S0606, also referred to as S6-1, SEQ ID NO: 47; S0608, also referred to as S6-2, SEQ ID NO: 48; and S11-1, SEQ ID NO: 49). These sequences are aligned with the corresponding conserved regions of flax L6 (SEQ ID NO: 50), tobacco N (SEQ ID NO: 51), tomato Prf (SEQ ID NO: 52), and Arabidopsis RPS2 (SEQ ID NO: 53) and RPM1 (SEQ ID NO: 54). The alignment was generated using PRETTYBOX function of GCG sequence analysis packages. SEQ ID NOs shown in parentheses set forth that portion of a particular RGH polypeptide of the invention that is shown in this figure.

[0014]FIG. 2 shows an alignment of kinase-2 domains of the novel RGHs M6-1, S6-1, S6-2, and S11-1 with the kinase-2 domains of tomato Prf, and Arabidopsis RPS2 and RPM1. Note that the putative introns have been removed from the deduced amino acid sequences of the novel RGHs and are shown as asterisks.

[0015]FIG. 3 shows sequence features of the S6-1 gene (SEQ ID NO: 34) subcloned from a sorghum BAC clone. Black and hatched boxes represent coding regions and open boxes represent the putative intron located in the kinase-2 domain. The nucleotide numbers are shown above boxes and in italic, and numbers of the deduced amino acids are shown below boxes. LZ, leucine zipper; P, P-loop; K2, kinase-2; K3a, kinase-3a; TM, a putative transmembrane domain; X, conserved domain X; Y, conserved domain Y; LRRs, leucine-rich repeats. The similarity of each domain in S6-1 to the corresponding region of Arabidopsis RPM1 is indicated below individual domains. The corresponding PCR RGH clone of S6-1 (PCR S6-1; SEQ ID NO: 21) is also shown.

[0016]FIG. 4 schematically shows a plasmid vector comprising a RGH sequence of the invention operably linked to the ubiquitin promoter.

DETAILED DESCRIPTION OF THE INVENTION

[0017] The invention is drawn to compositions and methods for creating or enhancing resistance in a plant to plant pests. Accordingly, the compositions and methods are also useful in protecting plants against fungal pathogens, viruses, nematodes, insects, and the like.

[0018] By “disease resistance” is intended that the plants avoid the disease symptoms that are the outcome of plant-pathogen interactions. That is, pathogens are prevented from causing plant diseases and the associated disease symptoms. The compositions and methods of the invention can be utilized to protect plants from disease, particularly those diseases that are caused by plant pathogens.

[0019] Pathogens of the invention include, but are not limited to, viruses or viroids, bacteria, insects, nematodes, fungi, and the like. Viruses include any plant virus, for example, tobacco or cucumber mosaic virus, ringspot virus, necrosis virus, maize dwarf mosaic virus, etc. Specific fungal and viral pathogens for the major crops include:

[0020] Soybeans: Phytophthora megasperma fsp. glycinea, Macrophomina phaseolina, Rhizoctonia solani, Sclerotinia sclerotiorum, Fusarium oxysporum, Diaporthe phaseolorum var. sojae (Phomopsis sojae), Diaporthe phaseolorum var. caulivora, Sclerotium rolfsii, Cercospora kikuchii, Cercospora sojina, Peronospora manshurica, Colletotrichum dematium (Colletotichum truncatum), Corynespora cassiicola, Septoria glycines, Phyllosticta sojicola, Alternaria alternata, Pseudomonas syringae p.v. glycinea, Xanthomonas campestris p.v. phaseoli, Microsphaera diffusa, Fusarium semitectum, Phialophora gregata, Soybean mosaic virus, Glomerella glycines, Tobacco Ring spot virus, Tobacco Streak virus, Phakopsora pachyrhizi, Pythium aphanidermatum, Pythium ultimum, Pythium debaryanum, Tomato spotted wilt virus, Heterodera glycines Fusarium solani; Canola: Albugo candida, Alternaria brassicae, Leptosphaeria maculans, Rhizoctonia solani, Sclerotinia sclerotiorum, Mycosphaerella brassiccola, Pythium ultimum, Peronospora parasitica, Fusarium roseum, Alternaria alternata; Alfalfa: Clavibater michiganese subsp. insidiosum, Pythium ultimum, Pythium irregulare, Pythium splendens, Pythium debaryanum, Pythium aphanidermatum, Phytophthora megasperma, Peronospora trifoliorum, Phoma medicaginis var. medicaginis, Cercospora medicaginis, Pseudopeziza medicaginis, Leptotrochila medicaginis, Fusarium, Xanthomonas campestris p.v. alfalfae, Aphanomyces euteiches, Stemphylium herbarum, Stemphylium alfalfae; Wheat: Pseudomonas syringae p.v. atrofaciens, Urocystis agropyri, Xanthomonas campestris p.v. translucens, Pseudomonas syringae p.v. syringae, Alternaria alternata, Cladosporium herbarum, Fusarium graminearum, Fusarium avenaceum, Fusarium culmorum, Ustilago tritici, Ascochyta tritici, Cephalosporium gramineum, Collotetrichum graminicola, Erysiphe graminis f.sp. tritici, Puccinia graminis f.sp. tritici, Puccinia recondita f.sp. tritici, Puccinia striiformis, Pyrenophora tritici-repentis, Septoria nodorum, Septoria tritici, Septoria avenae, Pseudocercosporella herpotrichoides, Rhizoctonia solani, Rhizoctonia cerealis, Gaeumannomyces graminis var. tritici, Pythium aphanidermatum, Pythium arrhenomanes, Pythium ultimum, Bipolaris sorokiniana, Barley Yellow Dwarf Virus, Brome Mosaic Virus, Soil Borne Wheat Mosaic Virus, Wheat Streak Mosaic Virus, Wheat Spindle Streak Virus, American Wheat Striate Virus, Claviceps purpurea, Tilletia tritici, Tilletia laevis, Ustilago tritici, Tilletia indica, Rhizoctonia solani, Pythium arrhenomannes, Pythium gramicola, Pythium aphanidermatum, High Plains Virus, European wheat striate virus; Sunflower: Plasmophora halstedii, Sclerotinia sclerotiorum, Aster Yellows, Septoria helianthi, Phomopsis helianthi, Alternaria helianthi, Alternaria zinniae, Botrytis cinerea, Phoma macdonaldii, Macrophomina phaseolina, Erysiphe cichoracearum, Rhizopus oryzae, Rhizopus arrhizus, Rhizopus stolonifer, Puccinia helianthi, Verticillium dahliae, Erwinia carotovorum pv. carotovora, Cephalosporium acremonium, Phytophthora cryptogea, Albugo tragopogonis; Corn: Fusarium moniliforme var. subglutinans, Erwinia stewartii, Fusarium moniliforme, Gibberella zeae (Fusarium graminearum), Stenocarpella maydi (Diplodia maydis), Pythium irregulare, Pythium debaryanum, Pythium graminicola, Pythium splendens, Pythium ultimum, Pythium aphanidermatum, Aspergillusflavus, Bipolaris maydis O, T (Cochliobolus heterostrophus), Helminthosporium carbonum I, II & III (Cochliobolus carbonum), Exserohilum turcicum I, II & III, Helminthosporium pedicellatum, Physoderma maydis, Phyllosticta maydis, Kabatiella-maydis, Cercospora sorghi, Ustilago maydis, Puccinia sorghi, Puccinia polysora, Macrophomina phaseolina, Penicillium oxalicum, Nigrospora oryzae, Cladosporium herbarum, Curvularia lunata, Curvularia inaequalis, Curvularia pallescens, Clavibacter michiganense subsp. nebraskense, Trichoderma viride, Maize Dwarf Mosaic Virus A & B, Wheat Streak Mosaic Virus, Maize Chlorotic Dwarf Virus, Claviceps sorghi, Pseudonomas avenae, Erwinia chrysanthemi pv. zea, Erwinia carotovora, Corn stunt spiroplasma, Diplodia macrospora, Sclerophthora macrospora, Peronosclerospora sorghi, Peronosclerospora philippinensis, Peronosclerospora maydis, Peronosclerospora sacchari, Sphacelotheca reiliana, Physopella zeae, Cephalosporium maydis, Cephalosporium acremonium, Maize Chlorotic Mottle Virus, High Plains Virus, Maize Mosaic Virus, Maize Rayado Fino Virus, Maize Streak Virus, Maize Stripe Virus, Maize Rough Dwarf Virus; Sorghum: Exserohilum turcicum, Colletotrichum graminicola (Glomerella graminicola), Cercospora sorghi, Gloeocercospora sorghi, Ascochyta sorghina, Pseudomonas syringae p.v. syringae, Xanthomonas campestris p.v. holcicola, Pseudomonas andropogonis, Puccinia purpurea, Macrophomina phaseolina, Perconia circinata, Fusarium moniliforme, Alternaria alternata, Bipolaris sorghicola, Helminthosporium sorghicola, Curvularia lunata, Phoma insidiosa, Pseudomonas avenae (Pseudomonas alboprecipitans), Ramulispora sorghi, Ramulispora sorghicola, Phyllachara sacchari, Sporisorium reilianum (Sphacelotheca reiliana), Sphacelotheca cruenta, Sporisorium sorghi, Sugarcane mosaic H, Maize Dwarf Mosaic Virus A & B, Claviceps sorghi, Rhizoctonia solani, Acremonium strictum, Sclerophthona macrospora, Peronosclerospora sorghi, Peronosclerospora philippinensis, Sclerospora graminicola, Fusarium graminearum, Fusarium oxysporum, Pythium arrhenomanes, Pythium graminicola, etc.

[0021] Nematodes include parasitic nematodes such as root-knot, cyst, lesion, and renniform nematodes, etc.

[0022] Insect pests include insects selected from the orders Coleoptera, Diptera, Hymenoptera, Lepidoptera, Mallophaga, Homoptera, Hemiptera, Orthoptera, Thysanoptera, Dermaptera, Isoptera, Anoplura, Siphonaptera, Trichoptera, etc., particularly Coleoptera and Lepidoptera. Insect pests of the invention for the major crops include: Maize: Ostrinia nubilalis, European corn borer; Agrotis ipsilon, black cutworm; Helicoverpa zea, corn earworm; Spodoptera frugiperda, fall armyworm; Diatraea grandiosella, southwestern corn borer; Elasmopalpus lignosellus, lesser cornstalk borer; Diatraea saccharalis, surgarcane borer; Diabrotica virgifera, western corn rootworm; Diabrotica longicornis barberi, northern corn rootworm; Diabrotica undecimpunctata howardi, southern corn rootworm; Melanotus spp., wireworms; Cyclocephala borealis, northern masked chafer (white grub); Cyclocephala immaculata, southern masked chafer (white grub); Popillia japonica, Japanese beetle; Chaetocnema pulicaria, corn flea beetle; Sphenophorus maidis, maize billbug; Rhopalosiphum maidis, corn leaf aphid; Anuraphis maidiradicis, corn root aphid; Blissus leucopterus leucopterus, chinch bug; Melanoplusfemurrubrum, redlegged grasshopper; Melanoplus sanguinipes, migratory grasshopper; Hylemya platura, seedcorn maggot; Agromyza parvicornis, corn blot leafminer; Anaphothrips obscrurus, grass thrips; Solenopsis milesta, thief ant; Tetranychus urticae, twospotted spider mite; Sorghum: Chilo partellus, sorghum borer; Spodopterafrugiperda, fall armyworm; Helicoverpa zea, corn earworm; Elasmopalpus lignosellus, lesser cornstalk borer; Feltia subterranea, granulate cutworm; Phyllophaga crinita, white grub; Eleodes, Conoderus, and Aeolus spp., wireworms; Oulema melanopus, cereal leaf beetle; Chaetocnema pulicaria, corn flea beetle; Sphenophorus maidis, maize billbug; Rhopalosiphum maidis; corn leaf aphid; Sipha flava, yellow sugarcane aphid; Blissus leucopterus leucopterus, chinch bug; Contarinia sorghicola, sorghum midge; Tetranychus cinnabarinus, carmine spider mite; Tetranychus urticae, twospotted spider mite; Wheat: Pseudaletia unipunctata, army worm; Spodoptera frugiperda, fall armyworm; Elasmopalpus lignosellus, lesser cornstalk borer; Agrotis orthogonia, western cutworm; Elasmopalpus lignosellus, lesser cornstalk borer; Oulema melanopus, cereal leaf beetle; Hypera punctata, clover leaf weevil; Diabrotica undecimpunctata howardi, southern corn rootworm; Russian wheat aphid; Schizaphis graminum, greenbug; Macrosiphum avenae, English grain aphid; Melanoplus femurrubrum, redlegged grasshopper; Melanoplus differentialis, differential grasshopper; Melanoplus sanguinipes, migratory grasshopper; Mayetiola destructor, Hessian fly; Sitodiplosis mosellana, wheat midge; Meromyza americana, wheat stem maggot; Hylemya coarctata, wheat bulb fly; Frankliniella fusca, tobacco thrips; Cephus cinctus, wheat stem sawfly; Aceria tulipae, wheat curl mite; Sunflower: Suleima helianthana, sunflower bud moth; Homoeosoma electellum, sunflower moth; zygogramma exclamationis, sunflower beetle; Bothyrus gibbosus, carrot beetle; Neolasioptera murtfeldtiana, sunflower seed midge; Cotton: Heliothis virescens, cotton budworm; Helicoverpa zea, cotton bollworm; Spodoptera exigua, beet armyworm; Pectinophora gossypiella, pink bollworm; Anthonomus grandis grandis, boll weevil; Aphis gossypii, cotton aphid; Pseudatomoscelis seriatus, cotton fleahopper; Trialeurodes abutilonea, bandedwinged whitefly; Lygus lineolaris, tarnished plant bug; Melanoplusfemurrubrum, redlegged grasshopper; Melanoplus differentialis, differential grasshopper; Thrips tabaci, onion thrips; Franklinkiella fusca, tobacco thrips; Tetranychus cinnabarinus, carmine spider mite; Tetranychus urticae, twospotted spider mite; Rice: Diatraea saccharalis, sugarcane borer; Spodoptera frugiperda, fall armyworm; Helicoverpa zea, corn earworm; Colaspis brunnea, grape colaspis; Lissorhoptrus oryzophilus, rice water weevil; Sitophilus oryzae, rice weevil; Nephotettix nigropictus, rice leafhopper; Blissus leucopterus leucopterus, chinch bug; Acrosternum hilare, green stink bug; Soybean: Pseudoplusia includens, soybean looper; Anticarsia gemmatalis, velvetbean caterpillar; Plathypena scabra, green cloverworm; Ostrinia nubilalis, European corn borer; Agrotis ipsilon, black cutworm; Spodoptera exigua, beet armyworm; Heliothis virescens, cotton budworm; Helicoverpa zea, cotton bollworm; Epilachna varivestis, Mexican bean beetle; Myzus persicae, green peach aphid; Empoasca fabae, potato leafhopper; Acrosternum hilare, green stink bug; Melanoplus femurrubrum, redlegged grasshopper; Melanoplus differentialis, differential grasshopper; Hylemya platura, seedcorn maggot; Sericothrips variabilis, soybean thrips; Thrips tabaci, onion thrips; Tetranychus turkestani, strawberry spider mite; Tetranychus urticae, twospotted spider mite; Barley: Ostrinia nubilalis, European corn borer; Agrotis ipsilon, black cutworm; Schizaphis graminum, greenbug; Blissus leucopterus leucopterus, chinch bug; Acrosternum hilare, green stink bug; Euschistus servus, brown stink bug; Delia platura, seedcorn maggot; Mayetiola destructor, Hessian fly; Petrobia latens, brown wheat mite; Oil Seed Rape: Brevicoryne brassicae, cabbage aphid; Phyllotreta cruciferae, Flea beetle; Mamestra configurata, Bertha armyworm; Plutella xylostella, Diamond-back moth; Delia ssp., Root maggots.

[0023] Compositions of the invention include resistance gene homologues (RGHs) that are involved in plant disease resistance. In particular, the present invention provides for isolated nucleic acid molecules comprising nucleotide sequences encoding the amino acid sequences shown in SEQ ID NOS: 2, 5, 7, 9, 11, 13, 15, 17, 19, 23, 26, 29, and 36. Further provided are polypeptides having an amino acid sequence encoded by a nucleic acid molecule described herein, for example those set forth in SEQ ID NOS: 1, 3, 4, 6, 8, 10, 12, 14, 16, 18, 20, 21, 22, 24, 25, 27, 28, 30, 31, 32, 34, and 35, and fragments and variants thereof.

[0024] The naturally occurring disease resistance proteins or partial-length proteins encoded by the disclosed RGH nucleotide sequences, and fragments and variants thereof, are encompassed by the present invention. Where putative introns occur within the disclosed nucleotide sequences (such as in SEQ ID NOS: 3, 21, 24, 27, 30, 31, 32, and 34), compositions of the invention also encompass the mature form of the protein or partial-length protein encoded thereby following intron removal.

[0025] Compositions of the invention include isolated nucleic acid molecules comprising novel RGH sequences isolated from maize, rice, and sorghum. The RGH sequences isolated from maize are partial gene sequences designated as clones M5-1 (SEQ ID NO: 1), M6-1 (SEQ ID NO: 3, which sets forth the M6-1 sequence with its putative 126-bp intron, and SEQ ID NO: 4, which sets forth the M6-1 sequence with the putative intron removed), and M5-6 (SEQ ID NO: 6). These maize RGHs are partial open reading frames encoding polypeptides having the predicted amino acid sequences set forth in SEQ ID NOS: 2, 5, and 7, respectively.

[0026] The RGH sequences isolated from rice are partial gene sequences designated as clones R5-1 (SEQ ID NO: 8), R5-2 (SEQ ID NO: 10), R5-3 (SEQ ID NO: 12), and R5-4 (SEQ ID NO: 14). These RGHs are partial open reading frames encoding polypeptides having the predicted amino acid sequences set forth in SEQ ID NOS: 9, 11, 13, and 15, respectively.

[0027] The RGH sequences isolated from sorghum are partial gene sequences designated as clones S5-5 (SEQ ID NO: 16), S5-2A (SEQ ID NO: 18), S5-2B (SEQ ID NO: 20), S6-1 (SEQ ID NO: 21, which sets forth the S6-1 sequence with its putative 92-bp intron, and SEQ ID NO: 22, which sets forth the S6-1 sequence with the putative intron removed); S6-2 (SEQ ID NO: 24, which sets forth the S6-2 sequence with its putative 100-bp intron, and SEQ ID NO: 25, which sets forth the S6-2 sequence with its putative intron removed); S11-1 (SEQ ID NO: 27, which sets forth the S11-1 sequence with its putative 518-bp intron, and SEQ ID NO: 28, which sets forth the S11-1 sequence with its putative intron removed); S11-25 (SEQ ID NO: 30, which sets forth the S11-25 sequence without removal of a putative intron); S11-27 (SEQ ID NO: 31, which sets forth the S11-27 sequence without removal of a putative intron); and S11-34 (SEQ ID NO: 32, which sets forth the S11-34 sequence without removal of a putative intron). The full-length open reading frame sequence for the clone designated S6-1 and referred to as the S6-1 gene is also provided. The full-length open reading frame for the S6-1 gene is set forth as SEQ ID NO: 34 (which includes the putative 92-bp intron) and SEQ ID NO: 35 (which shows the S6-1 sequence with the putative intron removed).

[0028] The sorghum clones designated S5-5, S5-2A, S6-1, S6-2, and S11-1 encode polypeptides having the predicted amino acid sequences set forth in SEQ ID NOS: 17, 19, 23, 26, and 29, respectively. The sorghum clone designated S5-2B encodes a polypeptide that comprises the amino acid sequence set forth in SEQ ID NO: 46, which represents that portion of the polypeptide comprising a kinase-2 domain characteristic of products of R genes in the NBS-LRR superfamily (see FIGS. 1A-1B, and the sequence referred to as S0545). The full-length open reading frame of the S6-1 gene encodes a protein having a predicted amino acid sequence set forth in SEQ ID NO: 36.

[0029] The nucleotide sequences of the invention and the amino acid sequences encoded thereby, as well as fragments and variants thereof, are hereinafter referred to as RGH nucleotide sequences and RGH proteins, respectively. Thus, the term RGH protein encompasses the disclosed full-length and partial-length proteins encoded by the RGH nucleotide sequences disclosed herein.

[0030] The invention encompasses isolated or substantially purified nucleic acid or protein compositions. An “isolated” or “purified” nucleic acid molecule or protein, or biologically active portion thereof, is substantially free of other cellular material, or culture medium when produced by recombinant techniques, or substantially free of chemical precursors or other chemicals when chemically synthesized. Preferably, an “isolated” nucleic acid is free of sequences (preferably protein encoding sequences) that naturally flank the nucleic acid (i.e., sequences located at the 5′ and 3′ ends of the nucleic acid) in the genomic DNA of the organism from which the nucleic acid is derived. For example, in various embodiments, the isolated nucleic acid molecule can contain less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb, or 0.1 kb of nucleotide sequences that naturally flank the nucleic acid molecule in genomic DNA of the cell from which the nucleic acid is derived. A protein that is substantially free of cellular material includes preparations of protein or polypeptide having less than about 30%, 20%, 10%, 5%, (by dry weight) of contaminating protein. When the protein of the invention, or biologically active portion thereof, is recombinantly produced, preferably culture medium represents less than about 30%, 20%, 10%, or 5% (by dry weight) of chemical precursors or non-protein-of-interest chemicals.

[0031] Fragments and variants of the disclosed nucleotide sequences and proteins or partial-length proteins encoded thereby are also encompassed by the present invention. By “fragment” is intended a portion of the nucleotide sequence or a portion of the amino acid sequence, and hence a portion of the polypeptide or protein, encoded thereby. Fragments of a nucleotide sequence may encode protein fragments that retain the biological activity of the native RGH and hence confer disease resistance to a plant by interacting with the complementing phytopathogen avirulence gene product released into the plant by the invading plant pathogen. Alternatively, fragments of a nucleotide sequence that are useful as hybridization probes generally do not encode fragment proteins retaining biological activity. Thus, fragments of a nucleotide sequence may range from at least about 20 nucleotides, about 50 nucleotides, about 100 nucleotides, and up to the full-length nucleotide sequence encoding the RGH proteins of the invention.

[0032] A fragment of an RGH nucleotide sequence that encodes a biologically active portion of an RGH protein of the invention will encode at least 15, 20, 25, 30, 50, 100, 150, 200, or 250 contiguous amino acids, or up to the total number of amino acids present in a full-length RGH protein of the invention. Fragments of an RGH nucleotide sequence that are useful as hybridization probes for PCR primers generally need not encode a biologically active portion of an RGH protein.

[0033] A fragment of an RGH nucleotide sequence may encode a biologically active portion of an RGH protein, or it may be a fragment that can be used as a hybridization probe or PCR primer using methods disclosed below. A biologically active portion of an RGH protein can be prepared by isolating a portion of one of the RGH nucleotide sequences of the invention, expressing the encoded portion of the RGH protein (e.g., by recombinant expression in vitro), and assessing the activity of the encoded portion of the RGH protein. Nucleic acid molecules that are fragments of an RGH nucleotide sequence comprise at least 16, 20, 50, 75, 100, 150, 200, 250, 300, 350, 400, 450, 500, 600, 700, 800, 900, 1000, 1250, 1500, 1750, 2000, 2250, 2500, 2750, 2800, 2850, 2900, or 2950 nucleotides, or up to the number of nucleotides present in a full-length RGH nucleotide sequence disclosed herein (for example, 517, 634, 508, 498, 515, 506, 518, 510, 506, 505, 514, 609, 517, 605, 505, 1040, 522, 1044, 1038, 1043, 2954, or 2862 nucleotides for SEQ ID NOS: 1, 3, 4, 6, 8, 10, 12, 14, 16, 18, 20, 21, 22, 24, 25, 27, 28, 30, 31, 32, 34, and 35, respectively).

[0034] By “variants” is intended substantially similar sequences. For nucleotide sequences, conservative variants include those sequences that, because of the degeneracy of the genetic code, encode the amino acid sequence of one of the disease resistance polypeptides of the invention. Naturally occurring allelic variants such as these can be identified with the use of well-known molecular biology techniques, as, for example, with polymerase chain reaction (PCR) and hybridization techniques as outlined below. Variant nucleotide sequences also include synthetically derived nucleotide sequences, such as those generated, for example, by using site-directed mutagenesis but which still encode an RGH protein of the invention. Generally, nucleotide sequence variants of the invention will have at least 40, 50, 60, to 70%, generally, 80%, preferably 85%, 90%, up to 95%, 98% sequence identity to the native nucleotide sequence.

[0035] By “variant” protein is intended a protein derived from the native protein by deletion (so-called truncation) or addition of one or more amino acids to the N-terminal and/or C-terminal end of the native protein; deletion or addition of one or more amino acids at one or more sites in the native protein; or substitution of one or more amino acids at one or more sites in the native protein. Such variants may result from, for example, genetic polymorphism or from human manipulation. Methods for such manipulations are generally known in the art.

[0036] The proteins of the invention may be altered in various ways including amino acid substitutions, deletions, truncations, and insertions. Methods for such manipulations are generally known in the art. For example, amino acid sequence variants of the RGH proteins can be prepared by mutations in the DNA. Methods for mutagenesis and nucleotide sequence alterations are well known in the art. See, for example, Kunkel (1985) Proc. Natl. Acad. Sci. USA 82:488-492; Kunkel et al. (1987) Methods in Enzymol. 154:367-382; U.S. Pat. No. 4,873,192; Walker and Gaastra, eds. (1983) Techniques in Molecular Biology (MacMillan Publishing Company, New York) and the references cited therein. Guidance as to appropriate amino acid substitutions that do not affect biological activity of the protein of interest may be found in the model of Dayhoff et al. (1978) Atlas of Protein Sequence and Structure (Natl. Biomed. Res. Found., Washington, D.C.), herein incorporated by reference. Conservative substitutions, such as exchanging one amino acid with another having similar properties, may be preferred.

[0037] Thus, the genes and nucleotide sequences of the invention include both the naturally occurring sequences as well as mutant forms. Likewise, the proteins of the invention encompass both naturally occurring proteins as well as variations and modified forms thereof. Such variants will continue to possess the desired disease resistance activity. Obviously, the mutations that will be made in the DNA encoding the variant must not place the sequence out of reading frame and preferably will not create complementary regions that could produce secondary mRNA structure. See, EP Patent Application Publication No. 75,444.

[0038] The deletions, insertions, and substitutions of the protein sequence encompassed herein are not expected to produce radical changes in the characteristics of the protein. However, when it is difficult to predict the exact effect of the substitution, deletion, or insertion in advance of doing so, one skilled in the art will appreciate that the effect will be evaluated by routine screening assays. That is, the activity can be evaluated by monitoring for enhanced disease resistance.

[0039] The resistance gene homologues of the invention can be optimized for enhanced expression in plants of interest. See, for example, EPA0359472; WO91/16432; Perlak et al. (1991) Proc. Natl. Acad. Sci USA 88:3324-3328; and Murray et al. (1989) Nucleic Acids Res. 17:477-498. In this manner, the genes or gene fragments can be synthesized utilizing plant-preferred codons. See, for example, Murray et al. (1989) Nucleic Acids Res. 17:477-498, the disclosure of which is incorporated herein by reference. In this manner, synthetic genes can also be made based on the distribution of codons a particular host uses for a particular amino acid. See, for example, Campbell and Gowri (1990) Plant Physiol. 92:1 -11 for a discussion of host-preferred codon usage. Thus, the nucleotide sequences can be optimized for expression in any plant. It is recognized that all or any part of the gene sequence may

[0040] be optimized or synthetic. That is, synthetic or partially optimized sequences may also be used.

[0041] Variant nucleotide sequences and proteins also encompass sequences and proteins derived from a mutagenic and recombinogenic procedure such as DNA shuffling. With such a procedure, one or more different coding sequences can be manipulated to create a new protein possessing the desired properties. In this manner, libraries of recombinant polynucleotides are generated from a population of related sequence polynucleotides comprising sequence regions that have substantial sequence identity and can be homologously recombined in vitro or in vivo. For example, using this approach, sequence motifs encoding a domain of interest may be shuffled between the resistance gene homologues of the invention and other known disease resistance genes to obtain a new gene coding for a disease resistance protein with an improved property of interest, such as an improved interaction with its complementing phytopathogen avirulence gene product, which in turn enhances disease resistance. Strategies for such DNA shuffling are known in the art. See, for example, Stemmer (1994) Proc. Natl. Acad. Sci. USA 91:10747-10751; Stemmer (1994) Nature 370:389-391; Crameri et al. (1997) Nature Biotech. 15:436-438; Moore et al. (1997) J. Mol. Biol. 272:336-347; Zhang et al. (1997) Proc. Natl. Acad. Sci. USA 94:4504-4509; Crameri et al. (1998) Nature 391:288-291; and U.S. Pat. Nos. 5,605,793 and 5,837,458.

[0042] Other resistance genes well known in the art may be used in such a DNA shuffling approach. See, for example, Dixon et al. (1996) Cell 84(3):451-459; Reuber et al. (1996) Plant Cell 8(2):241-249; Grant et al. (1995) Science 269(5225):843-846; Bisgrove et al. (1994) Plant Cell 6(7):927-933; Dangl et al. (1992) Plant Cell 4(11):1359-1369; Ashfield et al. (1995) Genetics 141(4):1597-1604; Kunkel et al. (1993) Plant Cell 5(8):865-875; Jones et al. (1994) Science 266(5186):789-793; Mindrinos (1994) Cell 78(6):1089-1099; Bent et al. (1994) Science 265(5180):1856-1860; Dixon et al. (1995) Mol. Plant Microbe Interact. 8(2):200-206; Salmeron et al. (1996) Cell 86(1):123-133; Rommens et al. (1995) Plant Cell 7:1537-1544; Buschges et al. (1997) Cell 88(5):695-705; Song et al. (1995) Science 270(5243):1804-1806; Loh et al. (1995) Proc. Natl. Acad. Sci. USA 92(10):4181-4184; Tornero et al. (1996) Plant J. 10(2):315-330; Staskawicz et al. (1995) Science 268(5211):661-667; Whitham et al. (1994) Cell 78(6):1101-1115; Dickinson et al. (1993) Mol. Plant Microbe Interact. 6(3):341-347; Innis et al. (1993) Plant J 4:813-820; Leister et al. (1996) Proc. Natl. Acad. Sci. USA 93(26):15497-15502; Kanazin et al. (1996) Proc. Natl. Acad. Sci. USA 93(21):11746-11750; and Hammond-Kosack et al. (1996) Plant Cell 8(10):1773-1791; the disclosures of which are herein incorporated by reference.

[0043] Domain swapping allows for the generation of new resistance specificities. Such newly synthesized resistance genes can be designed for detection of a broad range of pathogen genotypes.

[0044] Thus nucleotide sequences of the invention and the proteins or partial-length proteins encoded thereby include the naturally occurring forms as well as variants and fragments thereof. The nucleotide sequences encoding the disease resistance proteins or partial-length proteins of the present invention can be the naturally occurring sequences or they may be synthetically derived sequences. Alternatively, the nucleotide sequences for the disease resistance gene homologues of the present invention can be utilized to isolate homologous disease resistance genes from other plants, including Arabidopsis, sorghum, Brassica, wheat, tobacco, cotton, tomato, barley, sunflower, cucumber, alfalfa, soybeans, sorghum, etc. In this manner, methods such as PCR, hybridization, and the like can be used to identify such sequences based on their sequence homology to the sequences set forth herein. Sequences isolated based on their sequence identity to the entire RGH sequences set forth herein or to fragments thereof are encompassed by the present invention.

[0045] In a PCR approach, oligonucleotide primers can be designed for use in PCR reactions to amplify corresponding DNA sequences from cDNA or genomic DNA extracted from any plant of interest. Methods for designing PCR primers and PCR cloning are generally known in the art and are disclosed in Sambrook et al. (1989) Molecular Cloning: A Laboratory Manual (2d ed., Cold Spring Harbor Laboratory Press, Plainview, N.Y.). See also Innis et al., eds. (1990) PCR Protocols: A Guide to Methods and Applications (Academic Press, New York); Innis and Gelfand, eds. (1995) PCR Strategies (Academic Press, New York); and Innis and Gelfand, eds. (1999) PCR Methods Manual (Academic Press, New York). Known methods of PCR include, but are not limited to, methods using paired primers, nested primers, single specific primers, degenerate primers, gene-specific primers, vector-specific primers, partially-mismatched primers, and the like.

[0046] In hybridization techniques, all or part of a known nucleotide sequence is used as a probe that selectively hybridizes to other corresponding nucleotide sequences present in a population of cloned genomic DNA fragments or cDNA fragments (i.e., genomic or cDNA libraries) from a chosen organism. The hybridization probes may be genomic DNA fragments, cDNA fragments, RNA fragments, or other oligonucleotides, and may be labeled with a detectable group such as ³²P, or any other detectable marker. Thus, for example, probes for hybridization can be made by labeling synthetic oligonucleotides based on the RGH sequence of the invention. Methods for preparation of probes for hybridization and for construction of cDNA and genomic libraries are generally known in the art and are disclosed in Sambrook et al. (1989) Molecular Cloning: A Laboratory Manual (2d ed., Cold Spring Harbor Laboratory Press, Plainview, N.Y.).

[0047] For example, the entire RGH sequence disclosed herein, or one or more portions thereof, may be used as a probe capable of specifically hybridizing to corresponding disease resistance gene sequences and messenger RNAs. To achieve specific hybridization under a variety of conditions, such probes include sequences that are unique among disease resistance gene sequences and are preferably at least about 10 nucleotides in length, and most preferably at least about 20 nucleotides in length. Such probes may be used to amplify corresponding disease resistance gene sequences from a chosen organism by PCR. This technique may be used to isolate additional coding sequences from a desired organism or as a diagnostic assay to determine the presence of coding sequences in an organism. Hybridization techniques include hybridization screening of plated DNA libraries (either plaques or colonies; see, for example, Sambrook et al. (1989) Molecular Cloning: A Laboratory Manual (2d ed., Cold Spring Harbor Laboratory Press, Plainview, N.Y.).

[0048] Hybridization of such sequences may be carried out under stringent conditions. By “stringent conditions” or “stringent hybridization conditions” is intended conditions under which a probe will hybridize to its target sequence to a detectably greater degree than to other sequences (e.g., at least 2-fold over background). Stringent conditions are sequence-dependent and will be different in different circumstances. By controlling the stringency of the hybridization and/or washing conditions, target sequences that are 100% complementary to the probe can be identified (homologous probing). Alternatively, stringency conditions can be adjusted to allow some mismatching in sequences so that lower degrees of similarity are detected (heterologous probing). Generally, a probe is less than about 1000 nucleotides in length, preferably less than 500 nucleotides in length.

[0049] Typically, stringent conditions will be those in which the salt concentration is less than about 1.5 M Na ion, typically about 0.01 to 1.0 M Na ion concentration (or other salts) at pH 7.0 to 8.3 and the temperature is at least about 30° C. for short probes (e.g., 10 to 50 nucleotides) and at least about 60° C. for long probes (e.g., greater than 50 nucleotides). Stringent conditions may also be achieved with the addition of destabilizing agents such as formamide. Exemplary low stringency conditions include hybridization with a buffer solution of 30 to 35% formamide, 1 M NaCl, 1% SDS (sodium dodecyl sulphate) at 37° C., and a wash in 1× to 2×SSC (20×SSC=3.0 M NaCl/0.3 M trisodium citrate) at 50 to 55° C. Exemplary moderate stringency conditions include hybridization in 40 to 45% formamide, 1.0 M NaCl, 1% SDS at 37° C., and a wash in 0.5× to 1×SSC at 55 to 60° C. Exemplary high stringency conditions include hybridization in 50% formamide, 1 M NaCl, 1% SDS at 37° C, and a wash in 0.1×SSC at 60 to 65° C.

[0050] Specificity is typically the function of post-hybridization washes, the critical factors being the ionic strength and temperature of the final wash solution. For DNA-DNA hybrids, the T_(m) can be approximated from the equation of Meinkoth and Wahl (1984) Anal. Biochem. 138:267-284: T_(m)=81.5° C. +16.6 (log M)+0.41 (% GC)−0.61 (% form)−500/L; where M is the molarity of monovalent cations, % GC is the percentage of guanosine and cytosine nucleotides in the DNA, % form is the percentage of formamide in the hybridization solution, and L is the length of the hybrid in base pairs. The T_(m) is the temperature (under defined ionic strength and pH) at which 50% of a complementary target sequence hybridizes to a perfectly matched probe. T_(m) is reduced by about 1EC for each 1% of mismatching; thus, T_(m), hybridization, and/or wash conditions can be adjusted to hybridize to sequences of the desired identity. For example, if sequences with ≧90% identity are sought, the T_(m) can be decreased 10° C. Generally, stringent conditions are selected to be about 5° C. lower than the thermal melting point (T_(m)) for the specific sequence and its complement at a defined ionic strength and pH. However, severely stringent conditions can utilize a hybridization and/or wash at 1, 2, 3, or 4° C. lower than the thermal melting point (T_(m)); moderately stringent conditions can utilize a hybridization and/or wash at 6, 7, 8, 9, or 10° C. lower than the thermal melting point (T_(m)); low stringency conditions can utilize a hybridization and/or wash at 11, 12, 13, 14, 15, or 20° C. lower than the thermal melting point (T_(m)). Using the equation, hybridization and wash compositions, and desired T_(m), those of ordinary skill will understand that variations in the stringency of hybridization and/or wash solutions are inherently described. If the desired degree of mismatching results in a T_(m) of less than 45° C. (aqueous solution) or 32° C. (formamide solution), it is preferred to increase the SSC concentration so that a higher temperature can be used. An extensive guide to the hybridization of nucleic acids is found in Tijssen (1993) Laboratory Techniques in Biochemistry and Molecular Biology—Hybridization with Nucleic Acid Probes, Part I, Chapter 2 (Elsevier, N.Y.); and Ausubel et al., eds. (1995) Current Protocols in Molecular Biology, Chapter 2 (Greene Publishing and Wiley-Interscience, New York). See Sambrook et al. (1989) Molecular Cloning: A Laboratory Manual (2d ed., Cold Spring Harbor Laboratory Press, Plainview, N.Y.). In general, sequences that encode a disease resistance protein and hybridize to the RGH sequences disclosed herein will be at least 40% to 50% homologous, about 60% to 70% homologous, and even about 80%, 85%, 90%, 95% to 98% homologous or more with the disclosed RGH sequence. That is, the sequence similarity of sequences may range, sharing at least about 40% to 50%, about 60% to 70%, and even about 80%, 85%, 90%, 95% to 98% sequence similarity.

[0051] Generally, since leader peptides are not highly conserved between monocots and dicots, sequences can be utilized from the carboxyterminal end of the protein as probes for the isolation of corresponding sequences from any plant. Nucleotide probes can be constructed and utilized in hybridization experiments as discussed above. In this manner, even gene sequences that are divergent in the aminoterminal region can be identified and isolated for use in the methods of the invention.

[0052] Thus the disclosed RGH nucleotide sequences or portions thereof can be used as probes for identifying nucleotide sequences for similar disease resistance genes in a chosen plant or organism. Once similar genes are identified, their respective nucleotide sequences can be utilized in the present invention to enhance disease resistance in a plant.

[0053] The following terms are used to describe the sequence relationships between two or more nucleic acids or polynucleotides: (a) “reference sequence”, (b) “comparison window”, (c) “sequence identity”, (d) “percentage of sequence identity”, and (e) “substantial identity”.

[0054] (a) As used herein, “reference sequence” is a defined sequence used as a basis for sequence comparison. A reference sequence may be a subset or the entirety of a specified sequence; for example, as a segment of a full-length cDNA or gene sequence, or the complete cDNA or gene sequence.

[0055] (b) As used herein, “comparison window” makes reference to a contiguous and specified segment of a polynucleotide sequence, wherein the polynucleotide sequence in the comparison window may comprise additions or deletions (i.e., gaps) compared to the reference sequence (which does not comprise additions or deletions) for optimal alignment of the two sequences. Generally, the comparison window is at least 20 contiguous nucleotides in length, and optionally can be 30, 40, 50, 100, or longer. Those of skill in the art understand that to avoid a high similarity to a reference sequence due to inclusion of gaps in the polynucleotide sequence a gap penalty is typically introduced and is subtracted from the number of matches.

[0056] Methods of alignment of sequences for comparison are well known in the art. Optimal alignment of sequences for comparison may be conducted by the local homology algorithm of Smith et al. (1981) Adv. Appl. Math. 2:482; by the homology alignment algorithm of Needleman et al. (1970) J. Mol. Biol. 48:443; by the search for similarity method of Pearson et al. (1988) Proc. Natl. Acad. Sci. 85:2444; by computerized implementations of these algorithms, including, but not limited to: CLUSTAL in the PC/Gene program by Intelligenetics, Mountain View, Calif.; GAP, BESTFIT, BLAST, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group (GCG), 575 Science Drive, Madison, Wis., USA; the CLUSTAL program is well described by Higgins et al. (1988) Gene 73:237-244 (1988); Higgins et al. (1989) CABIOS 5:151-153; Corpet et al. (1988) Nucleic Acids Res. 16:10881-90; Huang et al. (1992) Computer Applications in the Biosciences 8:155-65, and Person et al. (1994) Meth. Mol. Biol. 24:307-331; preferred computer alignment methods also include the BLASTP, BLASTN, and BLASTX algorithms (see Altschul et al. (1990) J. Mol. Biol. 215:403-410). Alignment is also often performed by inspection and manual alignment.

[0057] (c) As used herein, “sequence identity” or “identity” in the context of two nucleic acid or polypeptide sequences makes reference to the residues in the two sequences that are the same when aligned for maximum correspondence over a specified comparison window. When percentage of sequence identity is used in reference to proteins it is recognized that residue positions which are not identical often differ by conservative amino acid substitutions, where amino acid residues are substituted for other amino acid residues with similar chemical properties (e.g., charge or hydrophobicity) and therefore do not change the functional properties of the molecule. When sequences differ in conservative substitutions, the percent sequence identity may be adjusted upwards to correct for the conservative nature of the substitution. Sequences that differ by such conservative substitutions are said to have “sequence similarity” or “similarity”. Means for making this adjustment are well known to those of skill in the art. Typically this involves scoring a conservative substitution as a partial rather than a full mismatch, thereby increasing the percentage sequence identity. Thus, for example, where an identical amino acid is given a score of 1 and a non-conservative substitution is given a score of zero, a conservative substitution is given a score between zero and 1. The scoring of conservative substitutions is calculated, e.g., as implemented in the program PC/GENE (Intelligenetics, Mountain View, Calif.).

[0058] (d) As used herein, “percentage of sequence identity” means the value determined by comparing two optimally aligned sequences over a comparison window, wherein the portion of the polynucleotide sequence in the comparison window may comprise additions or deletions (i.e., gaps) as compared to the reference sequence (which does not comprise additions or deletions) for optimal alignment of the two sequences. The percentage is calculated by determining the number of positions at which the identical nucleic acid base or amino acid residue occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the window of comparison, and multiplying the result by 100 to yield the percentage of sequence identity.

[0059] (e)(i) The term “substantial identity” of polynucleotide sequences means that a polynucleotide comprises a sequence that has at least 70% sequence identity, preferably at least 80%, more preferably at least 90%, and most preferably at least 95%, compared to a reference sequence using one of the alignment programs described using standard parameters. One of skill in the art will recognize that these values can be appropriately adjusted to determine corresponding identity of proteins encoded by two nucleotide sequences by taking into account codon degeneracy, amino acid similarity, reading frame positioning, and the like. Substantial identity of amino acid sequences for these purposes normally means sequence identity of at least 60%, more preferably at least 70%, 80%, 90%, and most preferably at least 95%.

[0060] Another indication that nucleotide sequences are substantially identical is if two molecules hybridize to each other under stringent conditions. Generally, stringent conditions are selected to be about 5° C. lower than the thermal melting point (T_(m)) for the specific sequence at a defined ionic strength and pH. However, stringent conditions encompass temperatures in the range of about 1° C. to about 20° C., depending upon the desired degree of stringency as otherwise qualified herein. Nucleic acids that do not hybridize to each other under stringent conditions are still substantially identical if the polypeptides they encode are substantially identical. This may occur, e.g., when a copy of a nucleic acid is created using the maximum codon degeneracy permitted by the genetic code. One indication that two nucleic acid sequences are substantially identical is when the polypeptide encoded by the first nucleic acid is immunologically cross reactive with the polypeptide encoded by the second nucleic acid.

[0061] (e)(ii) The term “substantial identity” in the context of a peptide indicates that a peptide comprises a sequence with at least 70% sequence identity to a reference sequence, preferably 80%, more preferably 85%, most preferably at least 90% or 95% sequence identity to the reference sequence over a specified comparison window. Preferably, optimal alignment is conducted using the homology alignment algorithm of Needleman et al. (1970) J. Mol. Biol. 48:443. An indication that two peptide sequences are substantially identical is that one peptide is immunologically reactive with antibodies raised against the second peptide. Thus, a peptide is substantially identical to a second peptide, for example, where the two peptides differ only by a conservative substitution. Peptides that are “substantially similar” share sequences as noted above except that residue positions that are not identical may differ by conservative amino acid changes.

[0062] The RGHs of the present invention, fragments and variants thereof, and any similar sequences identified in other organisms or new resistance gene sequences synthesized by DNA shuffling can be utilized to enhance disease resistance in a plant.

[0063] Methods of the invention involve stably transforming a plant with one or more of these novel disease resistance gene homologue nucleotide sequences operably linked with a promoter capable of driving expression of a gene in a plant cell. Expression of the novel disease resistance gene homologues confers disease resistance to a plant by interacting with the complementing phytopathogen avirulence gene product released into the plant by the invading plant pathogen. The plant to be transformed may or may not have preexisting disease resistance genes present in its genome. If so, transformation with one of these novel disease resistance gene homologues further enhances disease resistance of the transformed plant to include resistance to pathogens carrying the complementing avirulence gene.

[0064] The plant undergoing transformation with the RGH of the present invention may additionally be transformed with its complementing avr gene operably linked to regulatory regions. The expression of the two genes in the plant cell induces the disease resistance pathway or induces immunity in the plant. That is, the expression of the genes can induce a defense response in the cell or can turn on the disease resistance pathway to obtain cell death. The end result can be controlled by the level of expression of the avr gene in the plant. Where the expression is sufficient to cause cell death, such response is a receptor-mediated programmed response. See, for example, Ryerson and Heath (1996) Plant Cell 8:393-402 and Dangl et al. (1996) Plant Cell 8:1793-1807.

[0065] The nucleotide sequences for the disease resistance gene homologues of the present invention are useful in the genetic manipulation of any plant when operably linked to a promoter that is functional within the plant. In this manner, the nucleotide sequences of the invention are provided in expression cassettes for expression in the plant of interest.

[0066] Such expression cassettes will include 5′ and 3′ regulatory sequences operably linked to an RGH sequence of the invention. By “operably linked” is intended a functional linkage between a promoter and a second sequence, wherein the promoter sequence initiates and mediates transcription of the DNA sequence corresponding to the second sequence. Generally, operably linked means that the nucleic acid sequences being linked are contiguous and, where necessary to join two protein coding regions, contiguous and in the same reading frame. The cassette may additionally contain at least one additional gene to be cotransformed into the organism. Alternatively, the additional gene(s) can be provided on multiple expression cassettes.

[0067] For example, in one embodiment of the invention, the expression cassette may additionally comprise the complementing avr gene for the resistance gene of the present invention operably linked to regulatory regions functional within the plant undergoing transformation. Alternatively, the complementing avr gene may be provided on another expression cassette. Preferably expression of the avr gene would be regulated by an inducible promoter, more preferably a pathogen-inducible promoter. In this manner, invasion of the plant by a nonspecific pathogen triggers expression of the avr gene. The avr gene product would then interact with the product of the introduced complementing resistance gene, whose expression may be under the control of a constitutive or inducible promoter. This specific recognition event would activate a cascade of plant resistance-related genes, leading to a hypersensitive response in the invaded cells and inhibition of further spread of the pathogen beyond the site of initial infection. Extent of the disease resistance response could be manipulated by altering expression of the avr gene via its promoter sequence, as disclosed in the co-pending application entitled “Methods for Enhancing Disease Resistance in Plants,” U.S. Patent Application Serial No. 60/076,151, filed Feb. 26, 1998, herein incorporated by reference.

[0068] Such an expression cassette is provided with a plurality of restriction sites for insertion of the RGH sequence to be under the transcriptional regulation of the regulatory regions. The expression cassette may additionally contain selectable marker genes.

[0069] The expression cassette will include in the 5′-3′ direction of transcription, a transcriptional and translational initiation region, an RGH sequence of the invention, and a transcriptional and translational termination region functional in plants. The transcriptional initiation region, the promoter, may be native or analogous or foreign or heterologous to the plant host. Additionally, the promoter may be the natural sequence or alternatively a synthetic sequence. By “foreign” is intended that the transcriptional initiation region is not found in the native plant into which the transcriptional initiation region is introduced or alternatively is found after transformation at a different site in the genome. As used herein, a chimeric gene comprises a coding sequence operably linked to a transcription initiation region that is heterologous to the coding sequence.

[0070] While it may be preferable to express the sequences using heterologous promoters, the native promoter sequences may be used. Such constructs would change expression levels of the RGH protein in the plant or plant cell. Thus, the phenotype of the plant or plant cell is altered.

[0071] A number of promoters can be used in the practice of the invention, including constitutive, pathogen-inducible, wound-inducible, and tissue-specific promoters. In the event that continuous resistance to a pathogen carrying the complementing avr gene is desirable, a constitutive promoter is preferable. Such constitutive promoters include, for example, the core promoter of the Rsyn7 (copending U.S. application Ser. No. 08/661,601); the core CaMV 35S promoter (Odell et al. (1985) Nature 313:810-812); rice actin (McElroy et al (1990) Plant Cell 2:163-171); ubiquitin (Christensen et al. (1989) Plant Mol. Biol. 12:619-632 and Christensen et al. (1992) Plant Mol. Biol. 18:675-689); pEMU (Last et al. (1991) Theor. Appl. Genet. 81:581-588); MAS (Velten et al. (1984) EMBO J. 3:2723-2730); ALS promoter (U.S. application Ser. No. 08/409,297), and the like. Other constitutive promoters include, for example, U.S. Pat. Nos. 5,608,149; 5,608,144; 5,604,121; 5,569,597; 5,466,785; 5,399,680; 5,268,463; and 5,608,142; and the co-pending application entitled “Constitutive Maize Promoters,” U.S. patent application Ser. No. 09/257,584, filed Feb. 25, 1999, herein incorporated by reference.

[0072] Alternatively, it may be desirable to have the introduced RGH sequence expressed upon pathogen invasion, wherein expression of the resistance gene results in the plant being primed in the event of invasion by the pathogen carrying the complementing avr gene. Such pathogen-inducible promoters include those from pathogenesis-related proteins (PR proteins), which are induced following infection by a pathogen; e.g., PR proteins, SAR proteins, beta-1,3-glucanase, chitinase, etc. See, for example, Redolfi et al (1983) Neth. J. Plant Pathol. 89:245-254; Uknes et al. (1992) The Plant Cell 4:645-656; and Van Loon (1985) Plant Mol. Virol. 4:111-116. See also the co-pending application entitled “Inducible Maize Promoters”, U.S. patent application Ser. No. 09/257,583, filed Feb. 25, 1999, and herein incorporated by reference.

[0073] Of interest are promoters that are expressed locally at or near the site of pathogen infection. See, for example, Marineau et al. (1987) Plant Mol. Biol. 9:335-342; Matton et al. (1989) Molecular Plant-Microbe Interactions 2:325-331; Somsisch et al. (1986) Proc. Natl. Acad. Sci. USA 83:2427-2430; Sommsich et al. (1988) Mol. Gen. Genet.2:93-98; and Yang, Y (1996) Proc. Natl. Acad. Sci. USA 93:14972-14977. See also, Chen et al. (1996) Plant J. 10:955-966; Zhang and Sing (1994) Proc. Natl. Acad. Sci. USA 91:2507-2511; Warner et al. (1993) Plant J. 3:191-201; Siebertz et al. (1989) Plant Cell 1:961-968; and the references cited therein. Of particular interest is the inducible promoter for the maize PRms gene, whose expression is induced by the pathogen Fusarium moniliforme (see, for example, Cordero et al. (1992) Physiol.

[0074] Additionally, as pathogens find entry into plants through wounds or insect damage, a wound-inducible promoter may be used in the constructions of the invention. Such wound-inducible promoters include potato proteinase inhibitor (pin II) gene (Ryan (1990) Ann. Rev. Phytopath. 28:425-449; Duan et al. (1996) Nature Biotechnology 14:494-498); wun1 and wun2, U.S. Pat. No. 5,428,148; win1 and win2 (Stanford et al. (1989) Mol. Gen. Genet. 215:200-208); systemin (McGurl et al. (1992) Science 225:1570-1573); WIP1 (Rohmeier et al. (1993) Plant Mol. Biol. 22:783-792; Eckelkamp et al. (1993) FEBS Letters 323:73-76); MPI gene (Corderok et al. (1994) Plant J. 6(2):141-150); and the like, herein incorporated by reference.

[0075] Where expression of the introduced gene is preferred within a particular tissue that is susceptible to attack by the pathogen carrying the complementing avr gene, a tissue-specific promoter may be desirable. Tissue specific promoters include Yamamoto et al. (1997) Plant J. 12(2)255-265; Kawamata et al. (1997) Plant Cell Physiol. 38(7):792-803; Hansen et al. (1997) Mol Gen Genet. 254(3):337-343; Russell et al. (1997) Transgenic Res. 6(2):157-168; Rinehart et al. (1996) Plant Physiol. 112(3):1331-1341; Van Camp et al. (1996) Plant Physiol. 112(2):525-535; Canevascini et al. (1996) Plant Physiol. 112(2):513-524; Yamamoto et al. (1994) Plant Cell Physiol. 35(5):773-778; Lam (1994) Results Probl Cell Differ. 20:181-196; Orozco et al. (1993) Plant Mol Biol. 23(6):1129-1138; Matsuoka et al. (1993) Proc Natl Acad Sci U S A 90(20):9586-9590; and Guevara-Garcia et al. (1993) Plant J. 4(3):495-505.

[0076] Any of these promoters can be modified, if necessary, for weak expression. Such weak promoters cause background levels of the disease resistance protein to be expressed. Generally, by “weak promoter” is intended either a promoter that drives expression of a coding sequence at a low level. By low level is intended at levels of about {fraction (1/1000)} transcripts to about {fraction (1/100,000)} transcripts to about {fraction (1/500,000)} transcripts. Alternatively, it is recognized that weak promoters also encompasses promoters that are expressed in only a few cells and not in others to give a total low level of expression. Where a promoter is expressed at unacceptably high levels, portions of the promoter sequence can be deleted or modified to decrease expression levels. Such weak constitutive promoters include, for example, the core promoter of the Rsyn7 (copending application Ser. No. 08/661,601), the core 35S CaMV promoter, and the like.

[0077] Thus the expression cassette will include in the 5′-3′ direction of transcription, a transcriptional and translational initiation region, a nucleotide sequence encoding the particular disease resistance protein of the present invention, and a transcriptional and translational termination region functional in plants. The termination region may be native with the transcriptional initiation region, may be native with the DNA sequence of interest, or may be derived from another source. Convenient termination regions are available from the Ti-plasmid of A. tumefaciens, such as the octopine synthase and nopaline synthase termination regions. See also, Guerineau et al. (1991) Mol. Gen. Genet. 262:141-144; Proudfoot (1991) Cell 64:671-674; Sanfacon et al. (1991) Genes Dev. 5:141-149; Mogen et al. (1990) Plant Cell 2:1261-1272; Munroe et al. (1990) Gene 91:151-158; Ballas et al. 1989) Nucleic Acids Res. 17:7891-7903; Joshi et al. (1987) Nucleic Acid Res. 15:9627-9639.

[0078] Where appropriate, the RGH sequence and any additional gene(s) may be optimized for increased expression in the transformed plant. That is, these nucleotide sequences can be synthesized using plant-preferred codons for improved expression. Methods are available in the art for synthesizing plant-preferred genes. See, for example, U.S. Pat. Nos. 5,380,831, 5,436, 391, and Murray et al. (1989) Nucleic Acids Res. 17:477-498, herein incorporated by reference.

[0079] Additional sequence modifications are known to enhance gene expression in a cellular host. These include elimination of sequences encoding spurious polyadenylation signals, exon-intron splice site signals, transposon-like repeats, and other such well-characterized sequences which may be deleterious to gene expression. The G-C content of the sequence may be adjusted to levels average for a given cellular host, as calculated by reference to known genes expressed in the host cell. When possible, the sequence is modified to avoid predicted hairpin secondary mRNA structures.

[0080] The expression cassettes may additionally contain 5′ leader sequences in the expression cassette construct. Such leader sequences can act to enhance translation. Translation leaders are known in the art and include: picomavirus leaders, for example, EMCV leader (Encephalomyocarditis 5′ noncoding region) (Elroy-Stein et al. (1989) PNAS USA 86:6126-6130); potyvirus leaders, for example, TEV leader (Tobacco Etch Virus) (Allison et al. (1986); MDMV leader (Maize Dwarf Mosaic Virus); Virology 154:9-20), and human immunoglobulin heavy-chain binding protein (BiP), (Macejak et al. (1991) Nature 353:90-94); untranslated leader from the coat protein mRNA of alfalfa mosaic virus (AMV RNA 4) (Jobling et al. (1987) Nature 325:622-625); tobacco mosaic virus leader (TMV) (Gallie et al. (1989) in Molecular Biology of RNA, ed. Cech (Liss, N.Y.), pp. 237-256); and maize chlorotic mottle virus leader (MCMV) (Lommel et al. (1991) Virology 81:382-385). See also, Della-Cioppa et al. (1987) Plant Physiol. 84:965-968. Other methods known to enhance translation can also be utilized, for example, introns, and the like.

[0081] In preparing the expression cassette, the various DNA fragments may be manipulated, so as to provide for the DNA sequences in the proper orientation and, as appropriate, in the proper reading frame. Toward this end, adapters or linkers may be employed to join the DNA fragments or other manipulations may be involved to provide for convenient restriction sites, removal of superfluous DNA, removal of restriction sites, or the like. For this purpose, in vitro mutagenesis, primer repair, restriction, annealing, resubstitutions, e.g. transitions and transversions may be involved.

[0082] The RGH sequences of the present invention can be used to transform any plant. In this manner, genetically modified plants, plant cells, plant tissue, seed, and the like can be obtained. Transformation protocols may vary depending on the type of plant or plant cell, i.e., monocot or dicot, targeted for transformation. Suitable methods of introducing nucleotide sequences into plant cells and subsequent insertion into the plant genome include microinjection (Crossway et al. (1986) Biotechniques 4:320-334), electroporation (Riggs et al. (1986) Proc. Natl. Acad. Sci. USA 83:5602-5606, Agrobacterium-mediated transformation (Townsend et al., U.S. Pat. No. 5,563,055), direct gene transfer (Paszkowski et al. (1984) EMBO J. 3:2717-2722), and ballistic particle acceleration (see, for example, Sanford et al., U.S. Pat. No. 4,945,050; Tomes et al. (1995) “Direct DNA Transfer into Intact Plant Cells via Microprojectile Bombardment,” in Plant Cell, Tissue, and Organ Culture: Fundamental Methods, ed. Gamborg and Phillips (Springer-Verlag, Berlin); and McCabe et al. (1988) Biotechnology 6:923-926). Also see Weissinger et al. (1988) Ann. Rev. Genet. 22:421-477; Sanford et al. (1987) Particulate Science and Technology 5:27-37 (onion); Christou et al. (1988) Plant Physiol. 87:671-674 (soybean); McCabe et al. (1988) Bio/Technology 6:923-926 (soybean); Finer and McMullen (1991) In Vitro Cell Dev. Biol. 27P:175-182 (soybean); Singh et al. (1998) Theor. Appl. Genet. 96:319-324 (soybean); Datta et al. (1990) Biotechnology 8:736-740 (rice); Klein et al. (1988) Proc. Nat. Acad. Sci. USA 85:4305-4309 (maize); Klein et al. (1988) Biotechnology 6:559-563 (maize); Tomes, U.S. Pat. No. 5,240,855; Buising et al., U.S. Pat. Nos. 5,322,783 and 5,324,646; Tomes et al. (1995) “Direct DNA Transfer into Intact Plant Cells via Microprojectile Bombardment,” in Plant Cell, Tissue, and Organ Culture: Fundamental Methods, ed. Gamborg (Springer-Verlag, Berlin) (maize); Klein et al. (1988) Plant Physiol. 91:440-444 (maize); Fromm et al. (1990) Biotechnology 8:833-839 (maize); Hooykaas-Van Slogteren et al. (1984) Nature (London) 311:763-764; Bowen et al., U.S. Pat. No. 5,736,369 (cereals); Bytebier et al. (1987) Proc. Natl. Acad. Sci. USA 84:5345-5349 (Liliaceae); De Wet et al. (1985) in The Experimental Manipulation of Ovule Tissues, ed. Chapman et al. (Longman, N.Y.), pp. 197-209 (pollen); Kaeppler et al. (1990) Plant Cell Reports 9:415-418 and Kaeppler et al. (1992) Theor. Appl. Genet. 84:560-566 (whisker-mediated transformation); D'Halluin et al. (1992) Plant Cell 4:1495-1505 (electroporation); Li et al. (1993) Plant Cell Reports 12:250-255 and Christou and Ford (1995) Annals of Botany 75:407-413 (rice); Osjoda et al. (1996) Nature Biotechnology 14:745-750 (maize via Agrobacterium tumefaciens); all of which are herein incorporated by reference.

[0083] The cells that have been transformed may be grown into plants in accordance with conventional ways. See, for example, McCormick et al. (1986) Plant Cell Reports 5:81-84. These plants may then be grown, and either pollinated with the same transformed strain or different strains, and the resulting hybrid having the desired phenotypic characteristic identified. Two or more generations may be grown to ensure that the subject phenotypic characteristic is stably maintained and inherited and then seeds harvested to ensure the desired phenotype or other property has been achieved.

[0084] The methods of the invention can be used with other methods available in the art for enhancing disease resistance in plants.

[0085] The following examples are offered by way of illustration and not by way of limitation.

Experimental

[0086] In the past few years, many R genes have been cloned from various plant species. Sequence analysis has shown that some conserved structural features are common among cloned R genes which confer resistance to bacterial, fungal, viral, and nematode pathogens (Staskawicz et al. (1995) Science 268:661-667; Bent (1996) Plant Cell 8:1757-1771; Baker et al. (1997) Science 276:726-733). As predicted by the gene-for-gene theory, the R gene products contain various important domains for interacting with pathogen elicitors (see review by Bent (1996) Plant Cell 8:1757-1771). Leucine-rich repeats (LRRs), which are the least conserved domains among R genes, are believed to be responsible for protein-protein interaction and thus might be involved in pathogen recognition and specificity. Recently, studies of two groups strongly support such a role for LRRs (Anderson et al. (1997) Plant Cell 9:641-651; Ori et al. (1997) Plant Cell 9:521-532). The presence of three conserved subdomains in nucleotide-binding site (NBS) domains, i.e., P-loop, kinase-2, and kinase-3a, suggests that ATG/GTP binding is essential for the function of some R gene products. LRRs and NBS are also thought to be involved in signaling at some level in the disease resistance signaling pathway (Baker et al. (1997) Science 276:726-733). A domain having some similarity to the cytoplasmic signaling domain of Toll/interleukin-1 (TIR) receptors is also present in some R gene products. Leucine zippers (LZ), which have a role in homo- and heterodimerization of eukaryotic transcription factors, are also found some R gene products. Serine/threonine kinase is part of some R gene products, suggesting an involvement of these proteins in the activation of resistance-related genes in signaling pathways.

[0087] To date, cloned R genes could be classified into several groups based on these conserved structural domains in their gene products. In the NBS-LRR superfamily, tobacco N, flax L6 and M, and Arabidopsis RPP5 have TIR on the amino-terminus of their gene products (Whitham et al. (1994) Cell 78:1101-1115; Lawrence et al. (1995) Plant Cell 7:1195-1206; Anderson et al. (1997) Plant Cell 19:641-651; Parker et al. (1997) Plant Cell 9:879-894), and Arabidopsis RPM1 and RPS2 and tomato Prf have LZ on the amino-terminus (Bent et al. (1994) Plant Cell 8:1757-1771; Mindrinos et al. (1994) Cell 78:1089-1099; Grant et al. (1995) Science 269:843-846; Salmeron et al. (1996) Cell 86:123-133). Tomato Cf-9 and Cf-2 and sugar beet HS1^(pro-1) belong to a group of R genes that have LRRs and a transmembrane domain (Jones et al. (1994) Science 266:789-793; Dixon et al. (1996) Cell 84:451-459; Cai et al. (1997) Science 275:832-834). Tomato Pto is a kinase-encoding R gene (Martin et al. (1993) Science 262:1432-1436), and wheat Lr10 has a transmembrane domain in addition to its kinase domain (Feuillet et al. (1997) Plant J. 11:45-52). Rice Xa21 is the only R gene encoding a LRR receptor kinase with a transmembrane region between LRRs and the kinase domain (Song et al. (1995) Science 270:1804-1806).

[0088] Recently, some groups have successfully isolated R gene candidates using PCR with primers designed based on highly conserved motifs among cloned R genes (Kanazin et al. (1996) Proc. Natl. Acad. Sci. USA 93:11746-11750; Leister et al. (1996) Nature Gen. 14:421-429, Yu et al. (1996) Proc. Natl. Acad. Sci. USA 93:11751-11756). Most of the PCR clones isolated using this approach contain some other conserved motifs besides those contributed by the primers. Some of the PCR clones have also been mapped near to and are potentially linked to known R genes. These results suggest that sufficient sequence differences exist between R genes and other sequences, and therefore, the PCR approach is useful for isolating R gene candidates. In addition, a gene encoding a new receptor-like kinase has been successfully cloned in wheat using a homologue probe of serine/threonine kinase genes (Feuillet et al. (1997) Plant J. 11:45-52). Therefore, isolating R gene candidates based on conserved motifs of cloned R genes is a practical approach among wide plant taxa.

[0089] In Example 1, a PCR approach was used to isolate resistance gene homologues (RGHs) from maize, sorghum, and rice. Thirteen RGH families were isolated and genetically mapped to the corresponding plant genomes. The corresponding gene of one RGH has also been isolated and shown to be a member of the LZ-NBS-LRR family. Example 2 demonstrates use of the RGH sequences for transformation of a plant to enhance disease resistance.

EXAMPLE 1 PCR Amplification, Cloning, and Sequence Analysis of RGHs

[0090] Degenerate primers LM638 and LM637, which were designed from the conserved P-loop and the putative transmembrane sequences (Kanazin et al. (1996) Proc. Natl. Acad. Sci. USA 93:11746-11750), respectively, were used for amplification of RGHs by PCR. Source DNA for RGH amplification included maize genomic DNA (Q66), maize total cDNA prepared from root (Lhad2), leaf (Lhad2), and two-leaf seedling (corn B73 cDNA library, Clontech Laboratories, Inc.); sorghum genomic DNA (BT×623); and rice genomic DNA (TQ). A 100-μl PCR cocktail containing 80 ng of source DNA, 20 pmol each of the primers, 5 units of Taq polymerase (Promega), 1×Taq polymerase reaction buffer (Promega), 2.5 mM MgCl₂, and 0.2 mM each of dATP, dTTP, dGTP, and dCTP was subjected to 35 cycles of PCR amplification as described in Kanazin et al. (1996) Proc. Natl. Acad. Sci. USA 93:11746-11750). To enrich the variability of RGH PCR products, the concentration of MgCl₂ was varied from to 2.5 to 6.0 mM with or without the addition of DMSO to a final concentration of 5%. Amplification products were cloned into pBluescript KS (Stratagene) or pGEM-T Easy (Promega) vector and sequenced using the Pharmacia Biotech model ALF Express automated sequencer. DNA sequences were analyzed using GCG (University of Wisconsin Genetics Computer Group, Madison) sequence analysis packages. Alignments of amino acid sequences were carried out using the PILEUP function, and phylogenetic analysis was done using the DISTANCES and GROWTREE functions.

[0091] BAC Library Screening and Sequence Analysis of an RGH

[0092] Two maize RGH clones, M5-1 and M6-1, were used to screen a sorghum BAC library (BT×623) and two rice BAC libraries (Teqing and Lemont) by the techniques described before (Woo et al. (1994) Nucleic Acids Res. 22:4922-4931). One RGH was subcloned from a sorghum BAC clone selected with M6-1 into a pBluescript KS vector with a Bam HI+Sal I double digestion and sequenced using the transposon-facilitated sequencing strategy (Strathmann et al. (1991) Proc. Natl. Acad. Sci. USA 88:1247-1250).

[0093] Cloning and Sequence Analysis of RGHs Isolated from Maize, Sorghum, and Rice

[0094] The results of PCR amplification, cloning, and classification are summarized in Table 1. By using primers LM638 and LM637 under varied conditions, several PCR products showing sequence homology to cloned R genes were identified in maize, sorghum, and rice. PCR products of 0.5, 0.6, and 1.1 kb were amplified from maize genomic DNA. The 0.5 and 0.6-kb maize PCR products were cloned and analyzed. TABLE 1 PCR Amplification, Cloning, and Families of Maize, Sorghum, and Rice RGHs. Source DNA PCR products RGH families^(a) Maize Genomic DNA 0.5 kb M5-1 0.6 kb M6-1 1.1 kb N.A.^(b) cDNA from: root 0.5 kb M5-1, M5-6 seedling 0.7 kb —^(c) leaf 0.5 kb M5-1, M5-6 0.7 kb N.A.^(b) Sorghum genomic DNA 0.5 kb S5-2A, S5-2B, S5-5 0.6 kb S6-1, S6-2 0.8 kb —^(c) 1.1 kb S11-1 Rice genomic DNA 0.5 kb R5-1, R5-2, R5-3, R5-4

[0095] Two families of maize RGHs, M5-1 (SEQ ID NO: 1) and M6-1 (SEQ ID NO: 3), were identified in these PCR products based on the results of cross-hybridization under high stringency conditions (0.1×SSC, 0.2% SDS) and sequencing. The predicted partial-length proteins encoded by these maize RGHs are set forth in SEQ ID NOS: 2 and 5, respectively. PCR products of 0.5 and/or 0.7 kb were amplified from maize cDNA prepared from root, seedling, and leaf, and M5-6 (SEQ ID NO: 6) was identified in the 0.5-kb PCR products in addition to M5-1. The predicted partial-length protein encoded by M5-6 is set forth in SEQ ID NO: 7.

[0096] A heterogeneous 0.5-kb PCR product was amplified from rice genomic DNA under regular PCR condition without DMSO and four rice RGH families (R5-1, SEQ ID NO: 8; R5-2, SEQ ID NO: 10; R5-3, SEQ ID NO: 12; and R5-4, SEQ ID NO: 14) were identified. The predicted partial-length proteins encoded by these rice RGHs are set forth in SEQ ID NOS: 9, 11, 13, and 15, respectively.

[0097] PCR products of 0.5, 0.6, 0.8, and 1.1 kb were amplified from sorghum genomic DNA. Six sorghum RGH families (S5-5, SEQ ID NO: 16; S5-2A, SEQ ID NO: 18; S5-2B, SEQ ID NO: 20; S6-1, SEQ ID NO: 21; S6-2, SEQ ID NO: 24; and S11, having four members referred to as S11-1 (SEQ ID NO: 27), S11-25 (SEQ ID NO: 30), S11-27 (SEQ ID NO: 31), and S11-34 (SEQ ID NO: 32) were identified in these PCR products except the 0.8-kb product. The predicted partial-length proteins encoded by S5-5, S5-2A, S6-1, S6-2, and S11-1 are set forth in SEQ ID NOS: 17, 19, 23, 26, and 29, respectively.

[0098] At least one clone from each RGH family was sequenced. The deduced amino acid sequences were highly conserved and showed striking homology to cloned R genes, particularly to Arabidopsis RPM1 and RPS2 and tomato Prf (FIGS. 1A-1B). Despite the size difference, all the RGH families identified had highly conserved kinase-2 and kinase-3a domains shared by R genes in the NBS-LRR superfamily in addition to P-loop and the putative transmembrane domain that were contributed by the primers. However, RGH families S6-1, S6-2 and S11-1 cannot be translated into polypeptides uninterrupted by stop codons and frameshifts were found in the sequences. When the sequences near the frameshifts were checked carefully, a putative intron was found within the kinase-2 domain coding sequences (data not shown; FIG. 2). Although RGH M6-1 could be translated into a polypeptide without interruption by stop codons, a putative intron was also found within the kinase-2 domain coding region. The size of the putative intron is 126 bp in M6-1 (nucleotides (nt) 211-336 of SEQ ID NO: 3), 92 bp in S6-1 (nt 220-311 of SEQ ID NO: 21), 100 bp in S6-2 (nt 229-328 of SEQ ID NO: 24), and 518 bp in S11-1 (nt 225-742 of SEQ ID NO: 27) (data not shown). Splicing of the putative introns results in coding sequences (SEQ ID NOS: 4, 22, 25, and 28, respectively) that are translated into the predicted partial-length proteins set forth in SEQ ID NOS: 5, 23, 26, and 29, respectively. This splicing gives these RGHs a similar size to cloned R genes and their homologues on the region between P-loop and the putative transmembrane domain (data not shown), as well as aligns the deduced amino acid sequences on kinase-2 domains of these RGHs and cloned R genes perfectly (FIG. 2).

[0099] Due to the usage of the degenerate primers in PCR amplification of RGHs and to the presence of a putative intron in some RGH clones, an alignment of amino acid sequences of all RGHs was done starting from the third amino acid of kinase-2 domain (FIGS. 1A-1B). The corresponding regions of Arabidopsis RMP1 and RPS2 and tomato Prf, tobacco N, and flax L6 were also included on the alignment for comparison. Based on this alignment, neighbor-joining method in DISTANCES function was used for analysis of sequence differences and a phylogenetic tree was constructed using the GROWTREE function (data not shown).

[0100] Genetic Mapping

[0101] M6-1 was mapped to maize chromosome bin 3.04, a region where several known R genes cluster, and is very close to Wsm2.

[0102] BAC Screening and Sequence Analysis of RGH S6-1

[0103] RGHs M5-1 (SEQ ID NO: 1) and M6-1 (SEQ ID NO: 3) were used as probes to screen sorghum (BT×623) BAC library and two rice (Lemont and Teqing) BAC libraries. Two rice Lemont BACs and three sorghum BACs were identified with M5-1 and M6-1, respectively (data not shown). Copy number of each RGH sequence was determined by digesting the BACs with Hae III, which does not have recognition sites within M5-1 and M6-1 sequences and then by hybridizing with individual RGH probes. One or two copies of M5-1 and one copy of M6-1 were contained in the BACs (data not shown).

[0104] Since M6-1 was mapped near to and is probably linked to Wsm2, subcloning and sequencing of its corresponding gene in one of the BACs were further carried out. A 10-kb BAM HI−Sal I sorghum BAC fragment containing M6-1 sequence was subcloned and sequenced approximately 7.2 kb from the Bam HI end. Sequence analysis indicated that, if not including the P-loop and the putative transmembrane domain contributed by the degenerate PCR primers, the corresponding region for PCR amplification of all RGHs in this sorghum BAC had a DNA sequence similarity of 99.7% and an amino acid sequence similarity of 100% to the PCR RGH family S6-1 (SEQ ID NO: 21) (data not shown). The same 92-bp putative intron found in the kinase-2 domain coding sequence of the PCR RGH family S6-1 was also present in the corresponding region in the BAC subclone (FIG. 3). This putative gene is apparently the corresponding gene or a very close homologue of S6-1 and hereafter referred to as the S6-1 gene. The S6-1 gene (residing within nt 3376-6329 of the BAC clone sequence set forth in SEQ ID NO: 33) is set forth in SEQ ID NO: 34. Removal of the putative 92-bp intron (nt 822-913 of SEQ ID NO: 34) results in a resistance gene having an open-reading frame of 2859 bp (see SEQ ID NO: 35). This 2859-bp region could be translated into a polypeptide of 953 amino acids (SEQ ID NO: 36) without interruption by stop codons (FIG. 3). No apparent continuous coding region could be found in the upstream 3.3-kb region of the putative initiation codon (nt 3376-3378 of SEQ ID NO: 33) or in the downstream 0.8-kb region of the putative stop codon (nt 6327-6329 of SEQ ID NO: 33) (data not shown). The deduced amino acid sequence of the 2859-bp coding region had each of the highly conserved domains present in Arabidopsis RPM1 and RPS2 and tomato Prf, i.e., LZ, P-loop, kinase-2, kinase-3a, a putative transmembrane domain, and LRRs (FIG. 3). In addition, an amino acid sequence alignment of S6-1, Arabidopsis RPM1 and RPS2, and tomato Prf further revealed two other conserved domains of unknown function X and Y. The deduced amino acid sequence (SEQ ID NO: 36) encoded by the S6-1 gene is very similar to that of Arabidopsis RPM1 with a similarity of 67% on LZ, 84% on the overall NBS region, 91% on P-loop, 100% on kinase-2, 85% on kinase-3a, 85% on the putative transmembrane domain, 86% on domain X, 100% on domain Y, and 66% on LRRs (FIG. 3).

[0105] Discussion

[0106] By using a PCR approach, 13 RGH families have been isolated from maize, sorghum, and rice. Although six to eleven RGH families were isolated from dicots under standard PCR conditions (Kanazin et al. (1996) Proc. Natl. Acad. Sci. USA 93:11746-11750; Leister et al. (1996) Nature Genetics 14:421-429; Yu et al. (1996) Proc. Natl. Acad. Sci. USA 93:11751 -11756), not as many RGH families were identified in maize, sorghum, and rice using a same PCR approach even though various PCR conditions have been tried. Only three RGH families were isolated from different maize sources of DNA under various PCR conditions (M5-1, M6-1, and M5-6), six families from sorghum genomic DNA under various PCR conditions (S5-5, S5-2A, S5-2B, S6-1, S6-2, and S11 (of which S11-1, S 11-25, S 11-27, and S 11-34 are members), and four families from rice genomic DNA under regular PCR conditions (R5-1, R5-2, R5-3, and R5-4) (Table 1). These results suggest that there might not be as many RGH families that belong to the NBS-LRR superfamily in these crops as in soybean and potato (Kanazin et al. (1996) Proc. Natl. Acad. Sci. USA 93:11746-11750; Leister et al. (1996) Nature Genetics 14:421-429; Yu et al. (1996) Proc. Natl. Acad. Sci. USA 93:11746-11750). Alternatively, and most likely, sequences coding for P-loop and the putative transmembrane domains of most RGHs might have diverged or even been lost in these monocots. Since the two degenerate primers used in the PCR amplification were designed from the conserved motifs in R genes cloned from dicots, they might not be able to amplify many of the RGHs in monocots. A phylogenetic tree based on an alignment of the deduced amino acid sequences of several cloned R genes and RGHs isolated from maize, sorghum, and rice also revealed a closer relationship of most RGHs of the same size to each other than to the R genes cloned from dicots (data not shown).

[0107] Interestingly, unlike all the cloned R genes in the NBS-LRR superfamily and the majority of their homologues isolated from soybean, potato, and rice (Table 1) having an approximately 0.5-kb continuous coding sequence in the region between the P-loop and the putative transmembrane domain, four RGH families of 0.6 kb or 1.1 kb (M6-1, S6-1, S6-2, S11-1) could not be translated into polypeptides without interruption by stop codons. A putative intron of 92-518 bp was found within the kinase-2 domain coding sequences of these RGHs. Removal of these putative introns would make the deduced amino acid sequences of these RGHs align perfectly with the R genes and RGHs in the same superfamily on kinase-2 domain (FIG. 2), as well as give a similar size on the corresponding regions between the P-loop and the putative transmembrane domain (data not shown). Analysis of the corresponding cDNA of these RGHs will reveal whether or not these introns are real. Some cloned R genes have been shown to have introns; however, no intron within a kinase-2 domain has been reported before.

[0108] Of the 13 RGH families isolated, three families were mapped near to known R genes. M6-1 and S6-1 were mapped to maize chromosomal bin 3.04. Several known R genes and quantitative trait loci (QTL) are mapped on this region: Rp3 (common rust), Mv1 (maize mosaic virus), Wsm2 (wheat streak mosaic virus), a QTL associated with resistance to European corn borer, and a QTL associated with resistance to fusarium stalk rot (McMullen and Simcox (1995) Microbe Interactions 6:811-815). Preliminary mapping results suggest that M6-1 is closer to Wsm2 than to Rp3, but how well M6-1 co-segregates with Wsm2 is not known at this point. M5-6 was mapped to maize chromosomal bin 7.04 where another European corn borer QTL is located. However, further information about whether or not M5-6 co-segregates with this QTL is still needed. Most of the RGHs isolated were not mapped to locations where known R genes or QTLs mapped. This is largely due to the lack of phenotypic data of disease reaction. Particularly, there is not yet a good mapping system of sorghum R genes, and not many R genes and useful markers have been mapped on the sorghum genome.

[0109] To confirm the identity of RGH M6-1 as an R gene candidate and to learn more about its sequence features, subcloning and sequencing of the corresponding gene in one of the BACs were further carried out. Sequencing results indicated that the corresponding gene of M6-1 in the sorghum BAC was indeed the S6-1 gene. The same putative 92-bp intron found in the PCR RGH family S6-1 (SEQ ID NO: 21) was also present on the same region in the S6-1 gene (FIG. 3). The sequence of S6-1 (SEQ ID NO: 34) obtained from the sorghum BAC subclone (SEQ ID NO: 33) showed that this putative 2954-bp R gene candidate could encode a polypeptide of 953 amino acids (SEQ ID NO: 36) with interruption by the putative 92-bp intron, and its deduced amino acid sequence had all the conserved domains shared by members of the LZ-NBS-LRR R gene family (FIG. 3). Similarity of S6-1 to Arabidopsis RPM1 on the conserved domains was particularly high (66% to 100%) (FIG. 3). The size of this putative gene and the position of all the conserved domains also matched that of Arabidopsis RPM1 and RPS2 perfectly. However, whether the region obtained and sequenced is a full S6-1 gene or whether there are other coding regions in its upstream and/or downstream positions is not known at this point due to the lack of analysis on the corresponding cDNA.

[0110] This study, along with the results obtained by other groups (Kanazin et al. 1996; Leister et al. 1996; Yu et al. 1996; Feuillet et al. 1997), indicates that homology-based isolation of R gene candidates could be beneficial to cloning or positioning real R genes conferring resistance to bacterial, fungal, viral, and nematode pathogens. However, whether or not these RGHs are functional R genes conferring resistance to any known or unknown pathogens of maize, sorghum, or rice is not known. Even for RGHs mapped near to known R genes, cDNA analysis and detailed co-segregation tests are still needed, and further DNA complementation transformation testing is also necessary to confirm their role in disease resistance.

EXAMPLE 2 Transformation and Regeneration of Transgenic Plants

[0111] Immature maize embryos from greenhouse donor plants are bombarded with a plasmid containing one of the RGH sequences of the invention operably linked to the ubiquitin (UBI) promoter (FIG. 4) plus a plasmid containing the selectable marker gene PAT (Wohlleben et al. (1988) Gene 70:25-37) that confers resistance to the herbicide Bialaphos. Transformation is performed as follows. All media recipes are in the Appendix.

[0112] Preparation of Target Tissue

[0113] The ears are surface sterilized in 30% Chlorox bleach plus 0.5% Micro detergent for 20 minutes, and rinsed two times with sterile water. The immature embryos are excised and placed embryo axis side down (scutellum side up), 25 embryos per plate, on 560Y medium for 4 hours and then aligned within the 2.5-cm target zone in preparation for bombardment.

[0114] Preparation of DNA

[0115] A plasmid vector comprising one of the RGH seuqences of the invention operably linked to the ubiquitin promoter is made. This plasmid DNA plus plasmid DNA containing a PAT selectable marker is precipitated onto 1.1 μm (average diameter) tungsten pellets using a CaCl₂ precipitation procedure as follows:

[0116] 100 μl prepared tungsten particles in water

[0117] 10 μl (1 μg) DNA in TrisEDTA buffer (1 μg total)

[0118] 100 μl 2.5MCaCl₂

[0119] 10 μl 0.1 M spermidine

[0120] Each reagent is added sequentially to the tungsten particle suspension, while maintained on the multitube vortexer. The final mixture is sonicated briefly and allowed to incubate under constant vortexing for 10 minutes. After the precipitation period, the tubes are centrifuged briefly, liquid removed, washed with 500 ml 100% ethanol, and centrifuged for 30 seconds. Again the liquid is removed, and 105 μl 100% ethanol is added to the final tungsten particle pellet. For particle gun bombardment, the tungsten/DNA particles are briefly sonicated and 10 μl spotted onto the center of each macrocarrier and allowed to dry about 2 minutes before bombardment.

[0121] Particle Gun Treatment

[0122] The sample plates are bombarded at level #4 in particle gun #HE34-1 or #HE34-2. All samples receive a single shot at 650 PSI, with a total of ten aliquots taken from each tube of prepared particles/DNA.

[0123] Subsequent Treatment

[0124] Following bombardment, the embryos are kept on 560Y medium for 2 days, then transferred to 560R selection medium containing 3 mg/liter Bialaphos, and subcultured every 2 weeks. After approximately 10 weeks of selection, selection-resistant callus clones are transferred to 288J medium to initiate plant regeneration. Following somatic embryo maturation (2-4 weeks), well-developed somatic embryos are transferred to medium for germination and transferred to the lighted culture room. Approximately 7-10 days later, developing plantlets are transferred to 272V hormone-free medium in tubes for 7-10 days until plantlets are well established. Plants are then transferred to inserts in flats (equivalent to 2.5″ pot) containing potting soil and grown for 1 week in a growth chamber, subsequently grown an additional 1-2 weeks in the greenhouse, then transferred to classic 600 pots (1.6 gallon) and grown to maturity. Plants are monitored and scored for disease resistance. APPENDIX INGREDIENT AMOUNT UNIT 272 V D-I H₂O 950.000 Ml MS Salts (GIBCO 11117-074) 4.300 G Myo-Inositol 0.100 G MS Vitamins Stock Solution ## 5.000 Ml Sucrose 40.000 G Bacto-Agar @ 6.000 G Directions: @ = Add after bringing up to volume Dissolve ingredients in polished D-I H₂O in sequence Adjust to pH 5.6 Bring up to volume with polished D-I H₂O after adjusting pH Sterilize and cool to 60° C. ## = Dissolve 0.100 g of Nicotinic Acid; 0.020 g of Thiamine.HCL; 0.100 g of Pyridoxine.HCL; and 0.400 g of Glycine in 875.00 ml of polished D-I H₂O in sequence. Bring up to volume with polished D-I H₂O. Make in 400 ml portions. Thiamine.HCL & Pyridoxine.HCL are in Dark Desiccator. Store for one month, unless contamination or precipitation occurs, then make fresh stock. Total Volume (L) = 1.00 288 J D-I H₂O 950.000 Ml MS Salts 4.300 g Myo-Inositol 0.100 g MS Vitamins Stock Solution ## 5.000 ml Zeatin .5 mg/ml 1.000 ml Sucrose 60.000 g Gelrite @ 3.000 g Indoleacetic Acid 0.5 mg/ml # 2.000 ml 0.1 mM Abscisic Acid 1.000 ml Bialaphos 1 mg/ml # 3.000 ml Directions: @ = Add after bringing up to volume Dissolve ingredients in polished D-I H₂O in sequence Adjust to pH 5.6 Bring up to volume with polished D-I H₂O after adjusting pH Sterilize and cool to 60° C. Add 3.5 g/L of Gelrite for cell biology. ## = Dissolve 0.100 g of Nicotinic Acid; 0.020 g of Thiamine.HCL; 0.100 g of Pyridoxine.HCL; and 0.400 g of Glycine in 875.00 ml of polished D-I H₂O in sequence. Bring up to volume with polished D-I H₂O. Make in 400 ml portions. Thiamine.HCL & Pyridoxine.HCL are in Dark Desiccator. Store for one month, unless contamination or precipitation occurs, then make fresh stock. Total Volume (L) 1.00 560 R D-I Water, Filtered 950.000 ml CHU (N6) Basal Salts (SIGMA C-1416) 4.000 g Eriksson's Vitamin Mix (1000X SIGMA-1511 1.000 ml Thiamine.HCL 0.4 mg/ml 1.250 ml Sucrose 30.000 g 2,4-D 0.5 mg/ml 4.000 ml Gelrite @ 3.000 g Silver Nitrate 2 mg/ml # 0.425 ml Bialaphos 1 mg/ml # 3.000 ml Directions: @ = Add after bringing up to volume # = Add after sterilizing and cooling to temp. Dissolve ingredients in D-I H₂O in sequence Adjust to pH 5.8 with KOH Bring up to volume with D-I H₂O Sterilize and cool to room temp. Total Volume (L) = 1.00 560 Y D-I Water, Filtered 950.000 ml CHU (N6) Basal Salts (SIGMA C-1416) 4.000 g Eriksson's Vitamin Mix (1000X SIGMA-1511 1.000 ml Thiamine.HCL 0.4 mg/ml 1.250 ml Sucrose 120.000 g 2,4-D 0.5 mg/ml 2.000 ml L-Proline 2.880 g Gelrite @ 2.000 g Silver Nitrate 2 mg/ml # 4.250 ml Directions: @ = Add after bringing up to volume # = Add after sterilizing and cooling to temp. Dissolve ingredients in D-I H₂O in sequence Adjust to pH 5.8 with KOH Bring up to volume with D-I H₂O Sterilize and cool to room temp. ** Autoclave less time because of increased sucrose** Total Volume (L) = 1.00

[0125] All publications and patent applications mentioned in the specification are indicative of the level of those skilled in the art to which this invention pertains. All publications and patent applications are herein incorporated by reference to the same extent as if each individual publication or patent application was specifically and individually indicated to be incorporated by reference.

[0126] Although the foregoing invention has been described in some detail by way of illustration and example for purposes of clarity of understanding, it will be obvious that certain changes and modifications may be practiced within the scope of the appended claims.

1 54 1 517 DNA Zea mays CDS (2)...(517) 1 t ggg ggg gtg ggg aag acg act cta aca cag ctc atc tac aat gat gaa 49 Gly Gly Val Gly Lys Thr Thr Leu Thr Gln Leu Ile Tyr Asn Asp Glu 1 5 10 15 aga gta aag gag cat ttc cag tta agg gtg tgg ttg tgc gtt tct gaa 97 Arg Val Lys Glu His Phe Gln Leu Arg Val Trp Leu Cys Val Ser Glu 20 25 30 aat ttt gac gag atg aag ctt acc aag gaa aca att gaa tca gtt gct 145 Asn Phe Asp Glu Met Lys Leu Thr Lys Glu Thr Ile Glu Ser Val Ala 35 40 45 agt gga ttc tca tca gcc aca aca aac atg aac ctg ctc caa gaa gac 193 Ser Gly Phe Ser Ser Ala Thr Thr Asn Met Asn Leu Leu Gln Glu Asp 50 55 60 ctc tca aaa aag ctg caa ggt aaa cgg ttt ctt cta gtc ctt gat gat 241 Leu Ser Lys Lys Leu Gln Gly Lys Arg Phe Leu Leu Val Leu Asp Asp 65 70 75 80 gta tgg aat gag gat cct gaa aaa tgg gac aga tat cgt tgt gct cta 289 Val Trp Asn Glu Asp Pro Glu Lys Trp Asp Arg Tyr Arg Cys Ala Leu 85 90 95 ctt agc ggg gga aag gga agc agg att ata att acc acg cga aac aaa 337 Leu Ser Gly Gly Lys Gly Ser Arg Ile Ile Ile Thr Thr Arg Asn Lys 100 105 110 aat gtg ggg ata cta atg ggt ggg atg act cct tac cat cta aag cag 385 Asn Val Gly Ile Leu Met Gly Gly Met Thr Pro Tyr His Leu Lys Gln 115 120 125 cta tca aac gat gat tgc tgg cag ttg ttc aaa aaa cat gca ttt gta 433 Leu Ser Asn Asp Asp Cys Trp Gln Leu Phe Lys Lys His Ala Phe Val 130 135 140 gat ggt gac tcc agt tca cac cca gaa tta gaa ata ata ggc aag gac 481 Asp Gly Asp Ser Ser Ser His Pro Glu Leu Glu Ile Ile Gly Lys Asp 145 150 155 160 atc gtg aag aag ttg aaa ggc ctc ccc yta gcc cta 517 Ile Val Lys Lys Leu Lys Gly Leu Pro Xaa Ala Leu 165 170 2 172 PRT Zea mays VARIANT (1)...(172) Xaa = Any Amino Acid 2 Gly Gly Val Gly Lys Thr Thr Leu Thr Gln Leu Ile Tyr Asn Asp Glu 1 5 10 15 Arg Val Lys Glu His Phe Gln Leu Arg Val Trp Leu Cys Val Ser Glu 20 25 30 Asn Phe Asp Glu Met Lys Leu Thr Lys Glu Thr Ile Glu Ser Val Ala 35 40 45 Ser Gly Phe Ser Ser Ala Thr Thr Asn Met Asn Leu Leu Gln Glu Asp 50 55 60 Leu Ser Lys Lys Leu Gln Gly Lys Arg Phe Leu Leu Val Leu Asp Asp 65 70 75 80 Val Trp Asn Glu Asp Pro Glu Lys Trp Asp Arg Tyr Arg Cys Ala Leu 85 90 95 Leu Ser Gly Gly Lys Gly Ser Arg Ile Ile Ile Thr Thr Arg Asn Lys 100 105 110 Asn Val Gly Ile Leu Met Gly Gly Met Thr Pro Tyr His Leu Lys Gln 115 120 125 Leu Ser Asn Asp Asp Cys Trp Gln Leu Phe Lys Lys His Ala Phe Val 130 135 140 Asp Gly Asp Ser Ser Ser His Pro Glu Leu Glu Ile Ile Gly Lys Asp 145 150 155 160 Ile Val Lys Lys Leu Lys Gly Leu Pro Xaa Ala Leu 165 170 3 634 DNA Zea mays tgggggggtg gggaagacga ctcttgccag tgaagtgtac cggaggstcg aagcgcaatt 60 tgactacaga gctttcgtat cagtgtcaca gaaccctgac atgaagaaga tattgaggca 120 tatactctgc caccgagggt gtggcggcag ccaggaatgg gatgagcagc aactcatcca 180 cgccgtaaga gagttcctcc aggataagag gtatgcatgc atcgtatatc tgaattcttc 240 tcatggcaat cttgccttga aggaatatta tctctctgca gcttcaactc gtcagcaggg 300 acatgctttg attaaatttg tgcaaattcg tgctaggtac tttgttgtca ttgacgatat 360 atggagcaca tcagcatgga ggattatcag atgtgctttt cctgaaaaca actgttccag 420 taggatactg acaactactc gcatcgtcac agttgctaag tactgttgct cacctcacct 480 tgaccgtgtg tatgcactgg agcctctcga tgcagctcac tctgagagct tatttttaag 540 taggattttt ggttccgaag atagatgtcc tctccatctg aaagaagctt ccgacaaaat 600 actgaaaaga tgtggtggcc tccccctagc ctta 634 4 508 DNA Zea mays CDS (101)...(508) 4 tgggggggtg gggaagacga ctcttgccag tgaagtgtac cggaggstcg aagcgcaatt 60 tgactacaga gctttcgtat cagtgtcaca gaaccctgac atg aag aag ata ttg 115 Met Lys Lys Ile Leu 1 5 agg cat ata ctc tgc cac cga ggg tgt ggc ggc agc cag gaa tgg gat 163 Arg His Ile Leu Cys His Arg Gly Cys Gly Gly Ser Gln Glu Trp Asp 10 15 20 gag cag caa ctc atc cac gcc gta aga gag ttc ctc cag gat aag agg 211 Glu Gln Gln Leu Ile His Ala Val Arg Glu Phe Leu Gln Asp Lys Arg 25 30 35 tac ttt gtt gtc att gac gat ata tgg agc aca tca gca tgg agg att 259 Tyr Phe Val Val Ile Asp Asp Ile Trp Ser Thr Ser Ala Trp Arg Ile 40 45 50 atc aga tgt gct ttt cct gaa aac aac tgt tcc agt agg ata ctg aca 307 Ile Arg Cys Ala Phe Pro Glu Asn Asn Cys Ser Ser Arg Ile Leu Thr 55 60 65 act act cgc atc gtc aca gtt gct aag tac tgt tgc tca cct cac ctt 355 Thr Thr Arg Ile Val Thr Val Ala Lys Tyr Cys Cys Ser Pro His Leu 70 75 80 85 gac cgt gtg tat gca ctg gag cct ctc gat gca gct cac tct gag agc 403 Asp Arg Val Tyr Ala Leu Glu Pro Leu Asp Ala Ala His Ser Glu Ser 90 95 100 tta ttt tta agt agg att ttt ggt tcc gaa gat aga tgt cct ctc cat 451 Leu Phe Leu Ser Arg Ile Phe Gly Ser Glu Asp Arg Cys Pro Leu His 105 110 115 ctg aaa gaa gct tcc gac aaa ata ctg aaa aga tgt ggt ggc ctc ccc 499 Leu Lys Glu Ala Ser Asp Lys Ile Leu Lys Arg Cys Gly Gly Leu Pro 120 125 130 cta gcc tta 508 Leu Ala Leu 135 5 136 PRT Zea mays VARIANT (1)...(156) Xaa = Any Amino Acid 5 Met Lys Lys Ile Leu Arg His Ile Leu Cys His Arg Gly Cys Gly Gly 1 5 10 15 Ser Gln Glu Trp Asp Glu Gln Gln Leu Ile His Ala Val Arg Glu Phe 20 25 30 Leu Gln Asp Lys Arg Tyr Phe Val Val Ile Asp Asp Ile Trp Ser Thr 35 40 45 Ser Ala Trp Arg Ile Ile Arg Cys Ala Phe Pro Glu Asn Asn Cys Ser 50 55 60 Ser Arg Ile Leu Thr Thr Thr Arg Ile Val Thr Val Ala Lys Tyr Cys 65 70 75 80 Cys Ser Pro His Leu Asp Arg Val Tyr Ala Leu Glu Pro Leu Asp Ala 85 90 95 Ala His Ser Glu Ser Leu Phe Leu Ser Arg Ile Phe Gly Ser Glu Asp 100 105 110 Arg Cys Pro Leu His Leu Lys Glu Ala Ser Asp Lys Ile Leu Lys Arg 115 120 125 Cys Gly Gly Leu Pro Leu Ala Leu 130 135 6 498 DNA Zea mays CDS (1)...(498) 6 ggg ggg gtg ggg aag acg aca cta atc aga gaa cta tat gaa aag cca 48 Gly Gly Val Gly Lys Thr Thr Leu Ile Arg Glu Leu Tyr Glu Lys Pro 1 5 10 15 aca aca aag tca aag cag ttt gaa tgt caa gct tgg gtt agc ttc cca 96 Thr Thr Lys Ser Lys Gln Phe Glu Cys Gln Ala Trp Val Ser Phe Pro 20 25 30 ccg tat tta agt tct tca agc atc cta cag tta atc cat cag gaa ttg 144 Pro Tyr Leu Ser Ser Ser Ser Ile Leu Gln Leu Ile His Gln Glu Leu 35 40 45 gag gaa aca gat act tgg tgt ccc aga aaa caa gta gac aag aaa ctg 192 Glu Glu Thr Asp Thr Trp Cys Pro Arg Lys Gln Val Asp Lys Lys Leu 50 55 60 cac gaa ata ttg gat aag gac cgg ttc ttg ctc gtc ata gat gga gat 240 His Glu Ile Leu Asp Lys Asp Arg Phe Leu Leu Val Ile Asp Gly Asp 65 70 75 80 gtt agc aat agt gat tgt vat gcc atc ctt gcc gca tta cca gaa aag 288 Val Ser Asn Ser Asp Cys Xaa Ala Ile Leu Ala Ala Leu Pro Glu Lys 85 90 95 aac aac ggt agt agg att gtc cgt atc atg caa ggt ata cgt aag chr 336 Asn Asn Gly Ser Arg Ile Val Arg Ile Met Gln Gly Ile Arg Lys Xaa 100 105 110 cca cgt ggt att gct gca aaa cac tgg att gag ctg aaa tgt ttt gaa 384 Pro Arg Gly Ile Ala Ala Lys His Trp Ile Glu Leu Lys Cys Phe Glu 115 120 125 aca gaa aaa acc act agc sta ttc agc caa agg gtg tgc atg gaa gaa 432 Thr Glu Lys Thr Thr Ser Xaa Phe Ser Gln Arg Val Cys Met Glu Glu 130 135 140 aaa ata aat att gaa aac ttt gat gaa gtt ctg cat ggt ata acc aaa 480 Lys Ile Asn Ile Glu Asn Phe Asp Glu Val Leu His Gly Ile Thr Lys 145 150 155 160 ggc ctc ccc cta gcc tta 498 Gly Leu Pro Leu Ala Leu 165 7 166 PRT Zea mays VARIANT (1)...(166) Xaa = Any Amino Acid 7 Gly Gly Val Gly Lys Thr Thr Leu Ile Arg Glu Leu Tyr Glu Lys Pro 1 5 10 15 Thr Thr Lys Ser Lys Gln Phe Glu Cys Gln Ala Trp Val Ser Phe Pro 20 25 30 Pro Tyr Leu Ser Ser Ser Ser Ile Leu Gln Leu Ile His Gln Glu Leu 35 40 45 Glu Glu Thr Asp Thr Trp Cys Pro Arg Lys Gln Val Asp Lys Lys Leu 50 55 60 His Glu Ile Leu Asp Lys Asp Arg Phe Leu Leu Val Ile Asp Gly Asp 65 70 75 80 Val Ser Asn Ser Asp Cys Xaa Ala Ile Leu Ala Ala Leu Pro Glu Lys 85 90 95 Asn Asn Gly Ser Arg Ile Val Arg Ile Met Gln Gly Ile Arg Lys Xaa 100 105 110 Pro Arg Gly Ile Ala Ala Lys His Trp Ile Glu Leu Lys Cys Phe Glu 115 120 125 Thr Glu Lys Thr Thr Ser Xaa Phe Ser Gln Arg Val Cys Met Glu Glu 130 135 140 Lys Ile Asn Ile Glu Asn Phe Asp Glu Val Leu His Gly Ile Thr Lys 145 150 155 160 Gly Leu Pro Leu Ala Leu 165 8 515 DNA Oryza sp. CDS (3)...(515) 8 tg ggg ggg gtg ggg aag acg act ctt gct caa atg gtg tac aat gat 47 Gly Gly Val Gly Lys Thr Thr Leu Ala Gln Met Val Tyr Asn Asp 1 5 10 15 gaa aga gtc agc agg tac ttt caa cta aag ggc tgg gtg gat gtt tct 95 Glu Arg Val Ser Arg Tyr Phe Gln Leu Lys Gly Trp Val Asp Val Ser 20 25 30 gag ggt cat ttt gac gtt aag gca att gca aga aaa att atc atg ttg 143 Glu Gly His Phe Asp Val Lys Ala Ile Ala Arg Lys Ile Ile Met Leu 35 40 45 ttc act agg aat cca tgt gat ata gaa gat atg ggt aat ctc caa aat 191 Phe Thr Arg Asn Pro Cys Asp Ile Glu Asp Met Gly Asn Leu Gln Asn 50 55 60 atg ata acg gcg caa gta caa gac atg aag ttc ttt cta gtt ctt gac 239 Met Ile Thr Ala Gln Val Gln Asp Met Lys Phe Phe Leu Val Leu Asp 65 70 75 aat gtg tgg aat gtg cag aag gaa atc tgg gat gct ttg ctt tcc ctg 287 Asn Val Trp Asn Val Gln Lys Glu Ile Trp Asp Ala Leu Leu Ser Leu 80 85 90 95 ttg gtg ggt gct cag ttg gga atg atc tta tta aca aca cgc gat gaa 335 Leu Val Gly Ala Gln Leu Gly Met Ile Leu Leu Thr Thr Arg Asp Glu 100 105 110 act att tca aaa atg ata gga aca atg cct tct tat gat ctt agc ttc 383 Thr Ile Ser Lys Met Ile Gly Thr Met Pro Ser Tyr Asp Leu Ser Phe 115 120 125 cta act tct gaa gaa tca tgg cag ttg ttc aag cag atg gcc ttt gga 431 Leu Thr Ser Glu Glu Ser Trp Gln Leu Phe Lys Gln Met Ala Phe Gly 130 135 140 ttt ata gat caa cat atg gat caa cag ttc gaa gga ttt ggt agg aaa 479 Phe Ile Asp Gln His Met Asp Gln Gln Phe Glu Gly Phe Gly Arg Lys 145 150 155 att gtg gga aaa tgt gga ggc ctc ccc cta gcc tta 515 Ile Val Gly Lys Cys Gly Gly Leu Pro Leu Ala Leu 160 165 170 9 171 PRT Oryza sp. 9 Gly Gly Val Gly Lys Thr Thr Leu Ala Gln Met Val Tyr Asn Asp Glu 1 5 10 15 Arg Val Ser Arg Tyr Phe Gln Leu Lys Gly Trp Val Asp Val Ser Glu 20 25 30 Gly His Phe Asp Val Lys Ala Ile Ala Arg Lys Ile Ile Met Leu Phe 35 40 45 Thr Arg Asn Pro Cys Asp Ile Glu Asp Met Gly Asn Leu Gln Asn Met 50 55 60 Ile Thr Ala Gln Val Gln Asp Met Lys Phe Phe Leu Val Leu Asp Asn 65 70 75 80 Val Trp Asn Val Gln Lys Glu Ile Trp Asp Ala Leu Leu Ser Leu Leu 85 90 95 Val Gly Ala Gln Leu Gly Met Ile Leu Leu Thr Thr Arg Asp Glu Thr 100 105 110 Ile Ser Lys Met Ile Gly Thr Met Pro Ser Tyr Asp Leu Ser Phe Leu 115 120 125 Thr Ser Glu Glu Ser Trp Gln Leu Phe Lys Gln Met Ala Phe Gly Phe 130 135 140 Ile Asp Gln His Met Asp Gln Gln Phe Glu Gly Phe Gly Arg Lys Ile 145 150 155 160 Val Gly Lys Cys Gly Gly Leu Pro Leu Ala Leu 165 170 10 506 DNA Oryza sp. CDS (3)...(506) misc_feature (1)...(506) n = A,T,C or G 10 tg ggg ggg gtg ggg aag acg aca cta gct cag aaa ata ttc aat gat 47 Gly Gly Val Gly Lys Thr Thr Leu Ala Gln Lys Ile Phe Asn Asp 1 5 10 15 aaa aaa tta gaa ggg aga ttt gac cat cgt gcc tgg gtt tgt gtc tcc 95 Lys Lys Leu Glu Gly Arg Phe Asp His Arg Ala Trp Val Cys Val Ser 20 25 30 aag gag tat tct atg gtt tcc ctt ttg aca caa gtt ctt agt aat atg 143 Lys Glu Tyr Ser Met Val Ser Leu Leu Thr Gln Val Leu Ser Asn Met 35 40 45 aaa att cat tat gaa caa aat gaa tca gtt ggg aac ctt caa agc aaa 191 Lys Ile His Tyr Glu Gln Asn Glu Ser Val Gly Asn Leu Gln Ser Lys 50 55 60 ctc aaa gca ggc att gca gac aag agt ttt ttc ctt gtg ttg gat gat 239 Leu Lys Ala Gly Ile Ala Asp Lys Ser Phe Phe Leu Val Leu Asp Asp 65 70 75 gta tgg cac tat aaa gca tgg gaa gat tta cta aga act cca cta aat 287 Val Trp His Tyr Lys Ala Trp Glu Asp Leu Leu Arg Thr Pro Leu Asn 80 85 90 95 gcg gca gcc aca gga ata att cta gta act act cga gat gaa act att 335 Ala Ala Ala Thr Gly Ile Ile Leu Val Thr Thr Arg Asp Glu Thr Ile 100 105 110 gct cgt gta att ggg gtg gac cgg act cat aga gtt gat ttg atg tca 383 Ala Arg Val Ile Gly Val Asp Arg Thr His Arg Val Asp Leu Met Ser 115 120 125 gcc gat gta gga tgg gag tta ctt tgg agg agc atg aac atc aaa gan 431 Ala Asp Val Gly Trp Glu Leu Leu Trp Arg Ser Met Asn Ile Lys Xaa 130 135 140 gag aaa caa gtg aaa aat cta cgg gac aca ggt atc gag att gtc cgc 479 Glu Lys Gln Val Lys Asn Leu Arg Asp Thr Gly Ile Glu Ile Val Arg 145 150 155 aaa tgt ggt ggc ctc ccc cta gcc tta 506 Lys Cys Gly Gly Leu Pro Leu Ala Leu 160 165 11 168 PRT Oryza sp. VARIANT (1)...(168) Xaa = Any Amino Acid 11 Gly Gly Val Gly Lys Thr Thr Leu Ala Gln Lys Ile Phe Asn Asp Lys 1 5 10 15 Lys Leu Glu Gly Arg Phe Asp His Arg Ala Trp Val Cys Val Ser Lys 20 25 30 Glu Tyr Ser Met Val Ser Leu Leu Thr Gln Val Leu Ser Asn Met Lys 35 40 45 Ile His Tyr Glu Gln Asn Glu Ser Val Gly Asn Leu Gln Ser Lys Leu 50 55 60 Lys Ala Gly Ile Ala Asp Lys Ser Phe Phe Leu Val Leu Asp Asp Val 65 70 75 80 Trp His Tyr Lys Ala Trp Glu Asp Leu Leu Arg Thr Pro Leu Asn Ala 85 90 95 Ala Ala Thr Gly Ile Ile Leu Val Thr Thr Arg Asp Glu Thr Ile Ala 100 105 110 Arg Val Ile Gly Val Asp Arg Thr His Arg Val Asp Leu Met Ser Ala 115 120 125 Asp Val Gly Trp Glu Leu Leu Trp Arg Ser Met Asn Ile Lys Xaa Glu 130 135 140 Lys Gln Val Lys Asn Leu Arg Asp Thr Gly Ile Glu Ile Val Arg Lys 145 150 155 160 Cys Gly Gly Leu Pro Leu Ala Leu 165 12 518 DNA Oryza sp. CDS (3)...(518) misc_feature (1)...(518) n = A,T,C or G 12 tg ggg ggg gtg ggg aag acg act tta aca cag ctc gtc tac aat gat 47 Gly Gly Val Gly Lys Thr Thr Leu Thr Gln Leu Val Tyr Asn Asp 1 5 10 15 gtg aga gtn aag aag cat ttc cag tta aga atg tgg ctg tgt gtt tct 95 Val Arg Xaa Lys Lys His Phe Gln Leu Arg Met Trp Leu Cys Val Ser 20 25 30 gaa aac ttt gat gag gca gaa ctt acc aag gaa acg ata gaa tca gtt 143 Glu Asn Phe Asp Glu Ala Glu Leu Thr Lys Glu Thr Ile Glu Ser Val 35 40 45 gcg agc gga tta tca tcc gcc aca aca aac atg aac ttg ctt caa gaa 191 Ala Ser Gly Leu Ser Ser Ala Thr Thr Asn Met Asn Leu Leu Gln Glu 50 55 60 gac ctc tca aac aag ctg aaa ggc aaa agg ttt ctt cta gta ttg gat 239 Asp Leu Ser Asn Lys Leu Lys Gly Lys Arg Phe Leu Leu Val Leu Asp 65 70 75 gat gta tgg aat gag gat cct gat aga tgg gat aga tac cga cgt gct 287 Asp Val Trp Asn Glu Asp Pro Asp Arg Trp Asp Arg Tyr Arg Arg Ala 80 85 90 95 cta gtt gct ggt gca aaa gga agc aaa att atg gtg act act cga aat 335 Leu Val Ala Gly Ala Lys Gly Ser Lys Ile Met Val Thr Thr Arg Asn 100 105 110 gaa aat gtt ggg aaa tta atg ggc ggg ttg act cct tac tat cta aaa 383 Glu Asn Val Gly Lys Leu Met Gly Gly Leu Thr Pro Tyr Tyr Leu Lys 115 120 125 cag tta tca tac aat gat agc tgg cat tta ttc ata agc tat gca ttt 431 Gln Leu Ser Tyr Asn Asp Ser Trp His Leu Phe Ile Ser Tyr Ala Phe 130 135 140 gta gat ggt gac tcc agt gca cac cca agt ttg gaa atg atc ggc aag 479 Val Asp Gly Asp Ser Ser Ala His Pro Ser Leu Glu Met Ile Gly Lys 145 150 155 gaa att gtc cat aag ttg aaa ggc ctc ccc cta gcc tta 518 Glu Ile Val His Lys Leu Lys Gly Leu Pro Leu Ala Leu 160 165 170 13 172 PRT Oryza sp. VARIANT (1)...(172) Xaa = Any Amino Acid 13 Gly Gly Val Gly Lys Thr Thr Leu Thr Gln Leu Val Tyr Asn Asp Val 1 5 10 15 Arg Xaa Lys Lys His Phe Gln Leu Arg Met Trp Leu Cys Val Ser Glu 20 25 30 Asn Phe Asp Glu Ala Glu Leu Thr Lys Glu Thr Ile Glu Ser Val Ala 35 40 45 Ser Gly Leu Ser Ser Ala Thr Thr Asn Met Asn Leu Leu Gln Glu Asp 50 55 60 Leu Ser Asn Lys Leu Lys Gly Lys Arg Phe Leu Leu Val Leu Asp Asp 65 70 75 80 Val Trp Asn Glu Asp Pro Asp Arg Trp Asp Arg Tyr Arg Arg Ala Leu 85 90 95 Val Ala Gly Ala Lys Gly Ser Lys Ile Met Val Thr Thr Arg Asn Glu 100 105 110 Asn Val Gly Lys Leu Met Gly Gly Leu Thr Pro Tyr Tyr Leu Lys Gln 115 120 125 Leu Ser Tyr Asn Asp Ser Trp His Leu Phe Ile Ser Tyr Ala Phe Val 130 135 140 Asp Gly Asp Ser Ser Ala His Pro Ser Leu Glu Met Ile Gly Lys Glu 145 150 155 160 Ile Val His Lys Leu Lys Gly Leu Pro Leu Ala Leu 165 170 14 510 DNA Oryza sp. CDS (1)...(510) 14 ggg ggg gtg ggg aag acg acg gta gca cag atg gta tat aat gat gtc 48 Gly Gly Val Gly Lys Thr Thr Val Ala Gln Met Val Tyr Asn Asp Val 1 5 10 15 cgt gtt aga gaa cat ttt gag cac agt ggt tgg att cat gtc tcc cca 96 Arg Val Arg Glu His Phe Glu His Ser Gly Trp Ile His Val Ser Pro 20 25 30 aca ttt gat gtc cta agg ctg aca act gca ata act gag tca ttg acc 144 Thr Phe Asp Val Leu Arg Leu Thr Thr Ala Ile Thr Glu Ser Leu Thr 35 40 45 aag aga aac tgt ggc ttc aca cag cta agc cta gtt cat gag gtc ctc 192 Lys Arg Asn Cys Gly Phe Thr Gln Leu Ser Leu Val His Glu Val Leu 50 55 60 ctc aaa gaa ctg gat gga aag aaa tta ttt ttt gtg ctt gat gat gta 240 Leu Lys Glu Leu Asp Gly Lys Lys Leu Phe Phe Val Leu Asp Asp Val 65 70 75 80 tgg agc gaa tgt gaa agt tcc tgg cat gat tta ata cgt cca ctc agc 288 Trp Ser Glu Cys Glu Ser Ser Trp His Asp Leu Ile Arg Pro Leu Ser 85 90 95 tat gct cta aca gtc ata ata ttg gta aca aca agg agt aaa gag gtg 336 Tyr Ala Leu Thr Val Ile Ile Leu Val Thr Thr Arg Ser Lys Glu Val 100 105 110 gca cgt ctt gct gga aca gtg aag cca ttt tac ctt act gcc att cct 384 Ala Arg Leu Ala Gly Thr Val Lys Pro Phe Tyr Leu Thr Ala Ile Pro 115 120 125 aat gac gat tgc tgg cta ttg ttt cag cat ttt gct ttt ggg aag caa 432 Asn Asp Asp Cys Trp Leu Leu Phe Gln His Phe Ala Phe Gly Lys Gln 130 135 140 tgt gtg aat gaa aaa tcg agt tta gtt cag att ggc aag aaa att ttg 480 Cys Val Asn Glu Lys Ser Ser Leu Val Gln Ile Gly Lys Lys Ile Leu 145 150 155 160 cag aag tgt ggt ggc ctc ccc cta gcc tta 510 Gln Lys Cys Gly Gly Leu Pro Leu Ala Leu 165 170 15 170 PRT Oryza sp. 15 Gly Gly Val Gly Lys Thr Thr Val Ala Gln Met Val Tyr Asn Asp Val 1 5 10 15 Arg Val Arg Glu His Phe Glu His Ser Gly Trp Ile His Val Ser Pro 20 25 30 Thr Phe Asp Val Leu Arg Leu Thr Thr Ala Ile Thr Glu Ser Leu Thr 35 40 45 Lys Arg Asn Cys Gly Phe Thr Gln Leu Ser Leu Val His Glu Val Leu 50 55 60 Leu Lys Glu Leu Asp Gly Lys Lys Leu Phe Phe Val Leu Asp Asp Val 65 70 75 80 Trp Ser Glu Cys Glu Ser Ser Trp His Asp Leu Ile Arg Pro Leu Ser 85 90 95 Tyr Ala Leu Thr Val Ile Ile Leu Val Thr Thr Arg Ser Lys Glu Val 100 105 110 Ala Arg Leu Ala Gly Thr Val Lys Pro Phe Tyr Leu Thr Ala Ile Pro 115 120 125 Asn Asp Asp Cys Trp Leu Leu Phe Gln His Phe Ala Phe Gly Lys Gln 130 135 140 Cys Val Asn Glu Lys Ser Ser Leu Val Gln Ile Gly Lys Lys Ile Leu 145 150 155 160 Gln Lys Cys Gly Gly Leu Pro Leu Ala Leu 165 170 16 506 DNA Sorghum sp. CDS (3)...(506) 16 tt ggg ggg gtg ggg aag acg act cta gcc cag aag atc tat aat gac 47 Gly Gly Val Gly Lys Thr Thr Leu Ala Gln Lys Ile Tyr Asn Asp 1 5 10 15 cat aaa ata aaa gga agc ttt agt aaa caa gca tgg atc tgt gtt tct 95 His Lys Ile Lys Gly Ser Phe Ser Lys Gln Ala Trp Ile Cys Val Ser 20 25 30 caa caa tat tct gat att tca gtt ttg aaa gaa gtc ctt cgg aac atc 143 Gln Gln Tyr Ser Asp Ile Ser Val Leu Lys Glu Val Leu Arg Asn Ile 35 40 45 ggt gtt gat tat aag cat gat gaa act gtt gga gaa ctt agc aga agg 191 Gly Val Asp Tyr Lys His Asp Glu Thr Val Gly Glu Leu Ser Arg Arg 50 55 60 ctt gca ata gct gtc gaa aat gca agt ttc ttt ctt gtg ttg gat gat 239 Leu Ala Ile Ala Val Glu Asn Ala Ser Phe Phe Leu Val Leu Asp Asp 65 70 75 att tgg caa cat gag gtg tgg act aat tta ctc aga gcc cca tta aac 287 Ile Trp Gln His Glu Val Trp Thr Asn Leu Leu Arg Ala Pro Leu Asn 80 85 90 95 act gca gct aca gga aca att cta gta aca act cgt aat gat aca gtt 335 Thr Ala Ala Thr Gly Thr Ile Leu Val Thr Thr Arg Asn Asp Thr Val 100 105 110 gca cga gca att ggg gtg gaa gat att cat cga gta gaa ttg atg tca 383 Ala Arg Ala Ile Gly Val Glu Asp Ile His Arg Val Glu Leu Met Ser 115 120 125 gat gaa gta gga tgg aaa ttg ctt ttg aag agt atg aac att agc aaa 431 Asp Glu Val Gly Trp Lys Leu Leu Leu Lys Ser Met Asn Ile Ser Lys 130 135 140 gaa agt gaa gta gaa aac cta cga gtt tta ggg gtt gac att gtt cgt 479 Glu Ser Glu Val Glu Asn Leu Arg Val Leu Gly Val Asp Ile Val Arg 145 150 155 ttg tgt ggt ggc ctc ccc cta gcc cta 506 Leu Cys Gly Gly Leu Pro Leu Ala Leu 160 165 17 168 PRT Sorghum sp. 17 Gly Gly Val Gly Lys Thr Thr Leu Ala Gln Lys Ile Tyr Asn Asp His 1 5 10 15 Lys Ile Lys Gly Ser Phe Ser Lys Gln Ala Trp Ile Cys Val Ser Gln 20 25 30 Gln Tyr Ser Asp Ile Ser Val Leu Lys Glu Val Leu Arg Asn Ile Gly 35 40 45 Val Asp Tyr Lys His Asp Glu Thr Val Gly Glu Leu Ser Arg Arg Leu 50 55 60 Ala Ile Ala Val Glu Asn Ala Ser Phe Phe Leu Val Leu Asp Asp Ile 65 70 75 80 Trp Gln His Glu Val Trp Thr Asn Leu Leu Arg Ala Pro Leu Asn Thr 85 90 95 Ala Ala Thr Gly Thr Ile Leu Val Thr Thr Arg Asn Asp Thr Val Ala 100 105 110 Arg Ala Ile Gly Val Glu Asp Ile His Arg Val Glu Leu Met Ser Asp 115 120 125 Glu Val Gly Trp Lys Leu Leu Leu Lys Ser Met Asn Ile Ser Lys Glu 130 135 140 Ser Glu Val Glu Asn Leu Arg Val Leu Gly Val Asp Ile Val Arg Leu 145 150 155 160 Cys Gly Gly Leu Pro Leu Ala Leu 165 18 505 DNA Sorghum sp. CDS (2)...(505) 18 t ggg gga gtg ggg aag acg aca cta gct cag aaa tta tac aat gat caa 49 Gly Gly Val Gly Lys Thr Thr Leu Ala Gln Lys Leu Tyr Asn Asp Gln 1 5 10 15 aga cta aaa gga agc ttt gag aaa cat gca tgg att tgt gtt tcc cag 97 Arg Leu Lys Gly Ser Phe Glu Lys His Ala Trp Ile Cys Val Ser Gln 20 25 30 cag tac tct caa gtt cct ttg ttg aaa gaa ata ctt cga aat att gga 145 Gln Tyr Ser Gln Val Pro Leu Leu Lys Glu Ile Leu Arg Asn Ile Gly 35 40 45 gtg cag caa gag caa ggt gaa agt ttg gga gag ctg aag gcc aaa ctt 193 Val Gln Gln Glu Gln Gly Glu Ser Leu Gly Glu Leu Lys Ala Lys Leu 50 55 60 gca gaa gcc att aat gga aag aga ttt ttg ctt gtg ttg gat gac ttg 241 Ala Glu Ala Ile Asn Gly Lys Arg Phe Leu Leu Val Leu Asp Asp Leu 65 70 75 80 tgg gag tct gat gta tgg acc aat ttg cta aga act cca ctc gct gcc 289 Trp Glu Ser Asp Val Trp Thr Asn Leu Leu Arg Thr Pro Leu Ala Ala 85 90 95 gct gat caa gta aca att tta gta aca act cgg cat gat aca gtt gca 337 Ala Asp Gln Val Thr Ile Leu Val Thr Thr Arg His Asp Thr Val Ala 100 105 110 aag gca att gga gtt gga cat atg cac cga gtt gaa ttg ctg tca gag 385 Lys Ala Ile Gly Val Gly His Met His Arg Val Glu Leu Leu Ser Glu 115 120 125 gaa gta gga tgg gag tta cta tgg aag agt atg avc ata tct agt gag 433 Glu Val Gly Trp Glu Leu Leu Trp Lys Ser Met Xaa Ile Ser Ser Glu 130 135 140 aaa gaa gtg ctc aac ttg cgt gaa act gga att ggg att gtt caa aag 481 Lys Glu Val Leu Asn Leu Arg Glu Thr Gly Ile Gly Ile Val Gln Lys 145 150 155 160 tgt ggt ggc ctc ccc cta gcc tta 505 Cys Gly Gly Leu Pro Leu Ala Leu 165 19 168 PRT Sorghum sp. VARIANT (1)...(168) Xaa = Any Amino Acid 19 Gly Gly Val Gly Lys Thr Thr Leu Ala Gln Lys Leu Tyr Asn Asp Gln 1 5 10 15 Arg Leu Lys Gly Ser Phe Glu Lys His Ala Trp Ile Cys Val Ser Gln 20 25 30 Gln Tyr Ser Gln Val Pro Leu Leu Lys Glu Ile Leu Arg Asn Ile Gly 35 40 45 Val Gln Gln Glu Gln Gly Glu Ser Leu Gly Glu Leu Lys Ala Lys Leu 50 55 60 Ala Glu Ala Ile Asn Gly Lys Arg Phe Leu Leu Val Leu Asp Asp Leu 65 70 75 80 Trp Glu Ser Asp Val Trp Thr Asn Leu Leu Arg Thr Pro Leu Ala Ala 85 90 95 Ala Asp Gln Val Thr Ile Leu Val Thr Thr Arg His Asp Thr Val Ala 100 105 110 Lys Ala Ile Gly Val Gly His Met His Arg Val Glu Leu Leu Ser Glu 115 120 125 Glu Val Gly Trp Glu Leu Leu Trp Lys Ser Met Xaa Ile Ser Ser Glu 130 135 140 Lys Glu Val Leu Asn Leu Arg Glu Thr Gly Ile Gly Ile Val Gln Lys 145 150 155 160 Cys Gly Gly Leu Pro Leu Ala Leu 165 20 514 DNA Sorghum sp. misc_feature (1)...(514) n = A,T,C or G 20 tgggggggtg gggaagacga cactcacaca acatatatat gaagaagcaa agagccactt 60 ccaagtcctg gtatgggtat gcgtctctca gaatttcagt gcaagtnagt aggcacaaga 120 aatcgtaaaa caaatcccta aacttgacaa tgaaaatgga aatgaaagtg ctgaaggtct 180 gattgaaaaa agattgcagt ccaaacggtt cttgcttgtt ttggatgaca tgtggacaga 240 tcacgaagat gaatggaaaa aactgctagc cccatttaag aaaatgcaaa caaaaggtaa 300 catggcaata gtcacaacaa ggattccgaa ggtggcacga atggtagcaa cagtaggctg 360 tcaaataaga ttagaacggt taagtgatga agaatgcatg tgtttcttcc aagaatgtgt 420 gtttggtaac cgacaaacat gggaagggca tgctattttg catgatttcg ggtacaaaat 480 agtgaagaga ttgavaggsc tccccctagc ccta 514 21 609 DNA Sorghum sp. 21 tgggggggtg gggaagacga ctcttgccaa tgaagtgtac cagaagcttg aagggcaatt 60 tgactacaga gcttttgtgt cagtgtcaca aaaacctgac attaagaaga tattgaggca 120 tatactctgc cagtatagtt gccgagagtg tggcaacaat gaaatatggg atgagcagca 180 actcatcaac acaacaaggc agctccttaa ggataagcgg tatgcatcat atatttgttt 240 cttttcatga caatcttgct ttgaaggaat atacatgtgc ctatttattg atttgtgcaa 300 attcgtgtta ggtactttat tgtaattgat gatatatgga gcatatcagc atggagaact 360 atcagatgtg cttttcctga aaataattgt tccagtagaa tattgacaac tactcgcatc 420 atcacagtcg ctaagtattg ttgctctcct caccgcgacc atgtttatga actgaagcct 480 ctcgatgcag ctcactctaa gagcttattt tttaatagaa tttttggttc tgaagataga 540 tgtccccttc atctgaaaga agtttccaat ggaatattga aaaaatgtgg tggcctcccc 600 ttagcccta 609 22 517 DNA Sorghum sp. CDS (2)...(517) 22 t ggg ggg gtg ggg aag acg act ctt gcc aat gaa gtg tac cag aag ctt 49 Gly Gly Val Gly Lys Thr Thr Leu Ala Asn Glu Val Tyr Gln Lys Leu 1 5 10 15 gaa ggg caa ttt gac tac aga gct ttt gtg tca gtg tca caa aaa cct 97 Glu Gly Gln Phe Asp Tyr Arg Ala Phe Val Ser Val Ser Gln Lys Pro 20 25 30 gac att aag aag ata ttg agg cat ata ctc tgc cag tat agt tgc cga 145 Asp Ile Lys Lys Ile Leu Arg His Ile Leu Cys Gln Tyr Ser Cys Arg 35 40 45 gag tgt ggc aac aat gaa ata tgg gat gag cag caa ctc atc aac aca 193 Glu Cys Gly Asn Asn Glu Ile Trp Asp Glu Gln Gln Leu Ile Asn Thr 50 55 60 aca agg cag ctc ctt aag gat aag cgg tac ttt att gta att gat gat 241 Thr Arg Gln Leu Leu Lys Asp Lys Arg Tyr Phe Ile Val Ile Asp Asp 65 70 75 80 ata tgg agc ata tca gca tgg aga act atc aga tgt gct ttt cct gaa 289 Ile Trp Ser Ile Ser Ala Trp Arg Thr Ile Arg Cys Ala Phe Pro Glu 85 90 95 aat aat tgt tcc agt aga ata ttg aca act act cgc atc atc aca gtc 337 Asn Asn Cys Ser Ser Arg Ile Leu Thr Thr Thr Arg Ile Ile Thr Val 100 105 110 gct aag tat tgt tgc tct cct cac cgc gac cat gtt tat gaa ctg aag 385 Ala Lys Tyr Cys Cys Ser Pro His Arg Asp His Val Tyr Glu Leu Lys 115 120 125 cct ctc gat gca gct cac tct aag agc tta ttt ttt aat aga att ttt 433 Pro Leu Asp Ala Ala His Ser Lys Ser Leu Phe Phe Asn Arg Ile Phe 130 135 140 ggt tct gaa gat aga tgt ccc ctt cat ctg aaa gaa gtt tcc aat gga 481 Gly Ser Glu Asp Arg Cys Pro Leu His Leu Lys Glu Val Ser Asn Gly 145 150 155 160 ata ttg aaa aaa tgt ggt ggc ctc ccc tta gcc cta 517 Ile Leu Lys Lys Cys Gly Gly Leu Pro Leu Ala Leu 165 170 23 172 PRT Sorghum sp. 23 Gly Gly Val Gly Lys Thr Thr Leu Ala Asn Glu Val Tyr Gln Lys Leu 1 5 10 15 Glu Gly Gln Phe Asp Tyr Arg Ala Phe Val Ser Val Ser Gln Lys Pro 20 25 30 Asp Ile Lys Lys Ile Leu Arg His Ile Leu Cys Gln Tyr Ser Cys Arg 35 40 45 Glu Cys Gly Asn Asn Glu Ile Trp Asp Glu Gln Gln Leu Ile Asn Thr 50 55 60 Thr Arg Gln Leu Leu Lys Asp Lys Arg Tyr Phe Ile Val Ile Asp Asp 65 70 75 80 Ile Trp Ser Ile Ser Ala Trp Arg Thr Ile Arg Cys Ala Phe Pro Glu 85 90 95 Asn Asn Cys Ser Ser Arg Ile Leu Thr Thr Thr Arg Ile Ile Thr Val 100 105 110 Ala Lys Tyr Cys Cys Ser Pro His Arg Asp His Val Tyr Glu Leu Lys 115 120 125 Pro Leu Asp Ala Ala His Ser Lys Ser Leu Phe Phe Asn Arg Ile Phe 130 135 140 Gly Ser Glu Asp Arg Cys Pro Leu His Leu Lys Glu Val Ser Asn Gly 145 150 155 160 Ile Leu Lys Lys Cys Gly Gly Leu Pro Leu Ala Leu 165 170 24 605 DNA Sorghum sp. misc_feature (1)...(605) n = A,T,C or G 24 tgggggggtg gggaagacga ctcttgcgag ggcagtatac gacagccctc aagcaaagga 60 aaagtttcag tgccgtgctt gggttgctgc caccggtagc gatagctcgc cggagcagat 120 taggggtatc ctgcgtgata tacaccagca agttgttcca agagacacca tggattttga 180 caacaaccat cttgaggcat ccctcaagga atacctcagt gacaagaggt gtgtgctttt 240 gtttatgttt attccatgac aatctgatat ctagactatc ccaaatgttg gtatctatca 300 cttaagcttt aaaatcattg gttgataggt acttaattgt cattgatgac atccagatgg 360 atgaatggag aaccgtaaaa tcagtctttg aacacagcag cacaagcagc aggataatat 420 tgacgacaac tattcagcct atggctaata tgtgcagcag tcatggaaat ggttatgtct 480 accaaatgga cacccttggt gaagaagact ccaagaaaat agccttkcca gggatcaggt 540 caccngagct ggagcatggt tcagcagcgc tgctcagaaa atgtgatggc ctccccctag 600 cccta 605 25 505 DNA Sorghum sp. CDS (170)...(505) misc_feature (1)...(505) n = A,T,C or G 25 tgggggggtg gggaagacga ctcttgcgag ggcagtatac gacagccctc aagcaaagga 60 aaagtttcag tgccgtgctt gggttgctgc caccggtagc gatagctcgc cggagcagat 120 taggggtatc ctgcgtgata tacaccagca agttgttcca agagacacc atg gat ttt 178 Met Asp Phe 1 gac aac aac cat ctt gag gca tcc ctc aag gaa tac ctc agt gac aag 226 Asp Asn Asn His Leu Glu Ala Ser Leu Lys Glu Tyr Leu Ser Asp Lys 5 10 15 agg tac tta att gtc att gat gac atc cag atg gat gaa tgg aga acc 274 Arg Tyr Leu Ile Val Ile Asp Asp Ile Gln Met Asp Glu Trp Arg Thr 20 25 30 35 gta aaa tca gtc ttt gaa cac agc agc aca agc agc agg ata ata ttg 322 Val Lys Ser Val Phe Glu His Ser Ser Thr Ser Ser Arg Ile Ile Leu 40 45 50 acg aca act att cag cct atg gct aat atg tgc agc agt cat gga aat 370 Thr Thr Thr Ile Gln Pro Met Ala Asn Met Cys Ser Ser His Gly Asn 55 60 65 ggt tat gtc tac caa atg gac acc ctt ggt gaa gaa gac tcc aag aaa 418 Gly Tyr Val Tyr Gln Met Asp Thr Leu Gly Glu Glu Asp Ser Lys Lys 70 75 80 ata gcc ttk cca ggg atc agg tca ccn gag ctg gag cat ggt tca gca 466 Ile Ala Xaa Pro Gly Ile Arg Ser Xaa Glu Leu Glu His Gly Ser Ala 85 90 95 gcg ctg ctc aga aaa tgt gat ggc ctc ccc cta gcc cta 505 Ala Leu Leu Arg Lys Cys Asp Gly Leu Pro Leu Ala Leu 100 105 110 26 112 PRT Sorghum sp. VARIANT (1)...(112) Xaa = Any Amino Acid 26 Met Asp Phe Asp Asn Asn His Leu Glu Ala Ser Leu Lys Glu Tyr Leu 1 5 10 15 Ser Asp Lys Arg Tyr Leu Ile Val Ile Asp Asp Ile Gln Met Asp Glu 20 25 30 Trp Arg Thr Val Lys Ser Val Phe Glu His Ser Ser Thr Ser Ser Arg 35 40 45 Ile Ile Leu Thr Thr Thr Ile Gln Pro Met Ala Asn Met Cys Ser Ser 50 55 60 His Gly Asn Gly Tyr Val Tyr Gln Met Asp Thr Leu Gly Glu Glu Asp 65 70 75 80 Ser Lys Lys Ile Ala Xaa Pro Gly Ile Arg Ser Xaa Glu Leu Glu His 85 90 95 Gly Ser Ala Ala Leu Leu Arg Lys Cys Asp Gly Leu Pro Leu Ala Leu 100 105 110 27 1040 DNA Sorghum sp. misc_feature (1)...(1040) n = A,T,C or G 27 tgggggggtg gggaagacga cattggcaaa agaggtcagt cacaagntcc aaggcatttc 60 gattgtcatg cttttgtctc tgtctcgcaa cagccgaatg ttaagaaaat tatgaaggat 120 gtgatctctc aagtgccttg caaaaaggat tttacagaag atatcgacac ctgggatgaa 180 aagaaattta ttgggaagct tagagaactg ttacaagata agaggtaatt tcatgccccc 240 tttttttaga aaagaagaat ttattagttt tgctcctttt agtgttcaaa aaacctaaat 300 ataccacaga tcttgtaaat atgagaataa tagaatagtt gtctaagtgg aatatatttt 360 tttgtaatag acaaacttat ctgttgactt aatatttcct aatacagtag ttatgctaag 420 gtcttgttta cttccatcca aaacccaaaa attttcaaga ttctccgtca tatcgaatct 480 ttagacgcat gccwtggggt attaaatwta gmcgaaataa aagctaattg cacagtttgg 540 tcgaaattta cgagacgaat cctttgagcc tagttagtcc atgttgaatt tattttatca 600 tgttcttcaa attctatatt ttttcatgta tggcacttga attacttagc atggtgtagg 660 agtctcatgg tatgtctgta tttttttttc tacttaaact ccttcccata attggttaac 720 aatatttatt tatttctatc aggtacctca tcattattga tgatatatgg tctatactgg 780 catgggacgc tattaaatat gctttcccag agaataattt ttctagcaga attatagcaa 840 ctacacgcat cgttgatgta gcgaggtcat gttgtctcgg tggtaatgat cgcatgtacg 900 agatggaagc tctaagtggt cttcactcga aaaaattatt tttcaagaga acatttggat 960 ctgaagactg ttgcccggat gtgctaaaag aagtatcaaa tgaaatattg aaaaaatgtg 1020 gaggcctccc cttagcccta 1040 28 522 DNA Sorghum sp. CDS (112)...(522) misc_feature (1)...(522) n = A,T,C or G 28 tgggggggtg gggaagacga cattggcaaa agaggtcagt cacaagntcc aaggcatttc 60 gattgtcatg cttttgtctc tgtctcgcaa cagccgaatg ttaagaaaat t atg aag 117 Met Lys 1 gat gtg atc tct caa gtg cct tgc aaa aag gat ttt aca gaa gat atc 165 Asp Val Ile Ser Gln Val Pro Cys Lys Lys Asp Phe Thr Glu Asp Ile 5 10 15 gac acc tgg gat gaa aag aaa ttt att ggg aag ctt aga gaa ctg tta 213 Asp Thr Trp Asp Glu Lys Lys Phe Ile Gly Lys Leu Arg Glu Leu Leu 20 25 30 caa gat aag agg tac ctc atc att att gat gat ata tgg tct ata ctg 261 Gln Asp Lys Arg Tyr Leu Ile Ile Ile Asp Asp Ile Trp Ser Ile Leu 35 40 45 50 gca tgg gac gct att aaa tat gct ttc cca gag aat aat ttt tct agc 309 Ala Trp Asp Ala Ile Lys Tyr Ala Phe Pro Glu Asn Asn Phe Ser Ser 55 60 65 aga att ata gca act aca cgc atc gtt gat gta gcg agg tca tgt tgt 357 Arg Ile Ile Ala Thr Thr Arg Ile Val Asp Val Ala Arg Ser Cys Cys 70 75 80 ctc ggt ggt aat gat cgc atg tac gag atg gaa gct cta agt ggt ctt 405 Leu Gly Gly Asn Asp Arg Met Tyr Glu Met Glu Ala Leu Ser Gly Leu 85 90 95 cac tcg aaa aaa tta ttt ttc aag aga aca ttt gga tct gaa gac tgt 453 His Ser Lys Lys Leu Phe Phe Lys Arg Thr Phe Gly Ser Glu Asp Cys 100 105 110 tgc ccg gat gtg cta aaa gaa gta tca aat gaa ata ttg aaa aaa tgt 501 Cys Pro Asp Val Leu Lys Glu Val Ser Asn Glu Ile Leu Lys Lys Cys 115 120 125 130 gga ggc ctc ccc tta gcc cta 522 Gly Gly Leu Pro Leu Ala Leu 135 29 137 PRT Sorghum sp. 29 Met Lys Asp Val Ile Ser Gln Val Pro Cys Lys Lys Asp Phe Thr Glu 1 5 10 15 Asp Ile Asp Thr Trp Asp Glu Lys Lys Phe Ile Gly Lys Leu Arg Glu 20 25 30 Leu Leu Gln Asp Lys Arg Tyr Leu Ile Ile Ile Asp Asp Ile Trp Ser 35 40 45 Ile Leu Ala Trp Asp Ala Ile Lys Tyr Ala Phe Pro Glu Asn Asn Phe 50 55 60 Ser Ser Arg Ile Ile Ala Thr Thr Arg Ile Val Asp Val Ala Arg Ser 65 70 75 80 Cys Cys Leu Gly Gly Asn Asp Arg Met Tyr Glu Met Glu Ala Leu Ser 85 90 95 Gly Leu His Ser Lys Lys Leu Phe Phe Lys Arg Thr Phe Gly Ser Glu 100 105 110 Asp Cys Cys Pro Asp Val Leu Lys Glu Val Ser Asn Glu Ile Leu Lys 115 120 125 Lys Cys Gly Gly Leu Pro Leu Ala Leu 130 135 30 1044 DNA Sorghum sp. misc_feature (1)...(1044) n = A,T,C or G 30 tgggggggtg gggaagacga cattggcaaa agaggtcagt cacaagatcc aaggcatttc 60 gattgtcatg cttttgtctc tgtctcgcga cagccgaatg ttaagaaaat tatgaaggat 120 gtgatctctc aagtgccttg caaaaaggat tttacagagg atatcgacac ctgggatgaa 180 aagaaattta ttgggaagct tagagaactg ttacaagata agaggtaatt tcatgccccc 240 tttttttaga aaagaagaat ttatttgttt tgctcttttt agtgttcaaa aaaattaaat 300 ataccacaga tcttggtaaa tatgrgawta atagaattag ttgtctaagt ggaatatatt 360 ttttkgtaat agacaaactt atctgttgac ttaatatttc ctaatacagt agttatgcta 420 aggtcttgtt tacttccatc caaaacccaa aaattttcaa gattctccgt catatcgaat 480 ctttagacgc acgcatggag tattaaatat agacgaaaat aaaagctgat tgcacagttt 540 ggtcgaaatt tacgagacga atcttttgag cctagttagt ccatgttgaa tttattttat 600 catgttcttc aaattctata ttttttcatg tatggcactt gaattactta gcatggtgta 660 ggagtctcat ggtatgtctg tatttttttt ttctacttaa actccttccc ataattggtt 720 aacaatattt atttatttct atcaggtacc tbntcattat tgatgatata tggtctatac 780 tggcatggga cgctattaaa tatgctttcc cagagaataa tttttctagc agaattatag 840 caactacacg catcgttgat gtagcaaggt catgttgtct cggtggtaat gatcgcatgt 900 atgagatgga agctctaagt gatcttctct cgaaaaaatt atttttcaag agaacatttg 960 gatctgaaga ctgttgcccg gatgtgctaa aagaagtatc aaatgaaata ttgaaaaaaa 1020 tgtggaggcc tccccttagc ctta 1044 31 1038 DNA Sorghum sp. misc_feature (1)...(1038) n = A,T,C or G 31 tgggggggtg gggaagacga cattggcaaa agaggtcagt cactagatcc aaggcatttc 60 gattgtcatg cttttgtctc tgtctcgcaa cagccgaatg ttaagaaaat tatgaaggat 120 gtgatctctc aagtgccttg caaaaaggat tttacagaag atatcgacac ctgggatgaa 180 aagaaattta ttgggaagct tagagaactg ttacaagata agaggtaatt tcatgccccc 240 tttttttaga aaagaagaat ttattagttt tgctcttttt agtgttcaaa aaaattgaat 300 ataccacaga tcttgtaaat atgagaataa tagaatagtt gtctaagtgg aatatatttt 360 tttgtaatag acaaacttat ctgttgactt aatatttcct aatacagtag ttatgctaag 420 gtcttgttta cttccatcca aaacccaaaa attttcaaga ttctccgtca tatcgaatct 480 ttagacgcat gcmtggrgtw ttaaatatag acgaaaataa aagctaattg caccrgtttg 540 gtcgaaattt acgagacgaa tcttttgagc ctwgttagtc catgttgaat ttattttvtc 600 atgttcttca aattctatat tttttcatgt atggcacttg aattacttag catggtgtag 660 gagtctcatg gtatgtctgt attttttttt ctacttaaac tccttcccat aattggttaa 720 caatatttat ttatttctat caggtwcctc ntcactattg atgatatatg gtctatactg 780 gcatgggacg ctattaaata tgctttccca gagaataatt tttctagcag aattatagca 840 actacacgca tcgttgatgt agcaaggtca tgttgtctcg gtggtaatga tcgcatgtat 900 gagatggaag ctctaagtga tcttcactcg aaaaaattat ttttcaagag aacatttgga 960 tctgtagact gttgcccgga tgtgctaaag agtvtcaatg aatattgrga aatgtgggcc 1020 tcccctgcct tagcctta 1038 32 1043 DNA Sorghum sp. misc_feature (1)...(1043) n = A,T,C or G 32 tgggggggtg gggaagacga cattggcaaa ggaggtcagt cacaagatcc aaggcatttc 60 gattgtcatg cttttgtctc tgtctcgcaa cagccgaatg ttaagaaaat tatgaaggac 120 gtgatctctc aagtgccttg caaaaaggat tttacagaag atatcgacaa ctgggatgaa 180 aagaaattta ttgggaagct tagagaactg ttacaagata agaggtaatt tcatgccccc 240 tttttttaga aaagaagaat ttattagttt tgctcttttt agtgttcaaa aaaattaaat 300 ataccacaga tcttgtaaat atgrgawtaw trgaatagtt gtctaagtgg aataawattt 360 ttttgtaata gacaaactta tctgttgact taatattycc taatacagta gttatgctaa 420 ggtcttgttt acttccatcc aaaacccaaa aatttttcga gattctccgt catatcgaat 480 ctttagacgc atgcatggag tattaaatat agacgaaaat aaaagctaat tgcacagttt 540 ggtcgaaatt tacgagacga atcttttgag cctagttagt ccatgttgaa tttattttat 600 catgttcttc gaattctata ttttttcatg tatggcactt gaattactta gcatggtgta 660 ggagtctcat ggtatgtctg tatttttttt ttctacttaa actccatccc ataataggtt 720 aacaatattt atttatttct atcaggtacc tcmncattgt tgatgatata tggtctatac 780 tggcatggga cgctattaaa tatgctttcc cagagaataa tttttctagc agaattattg 840 caactacacg catcgttgat gtagcaaggt catgttgtct cggtggtaat gatcgcatgt 900 atgagatgga agctctaagt gatcttcact cgaaaaaatt atttttcaag agaacatttg 960 gatctgaaga ctgttgcccg gatgtgctaa aagaagtatc aaatgaaata ttgaaaaaat 1020 gtggaggcct ccccctagcc cta 1043 33 6760 DNA Artificial Sequence Sorghum sp. gene in BAC clone 33 aaaaaattac taactatacc tgtaatttgc gacataaatc ttttaaatct agttagtcca 60 tagttagrtt accattmccn aataaaacaa aatttcaaga ttttttcacc tcttttgggt 120 gtcatatcga atgtttcgta gaacatcgga agaggtcttt ggatactaat aaaaaaacta 180 attacataac tttgctgtaa actacaagac gaatttatta agtctaatta atctattatt 240 agtacatgta agttactata tgatttccaa ctatactata tacttaggcc ttgtttagtt 300 cccaccaaaa tacaaaaagt tttcaagatt cactgtcaca tcgaatcttt gcggcacatg 360 catggagcat gaaatataga taaaaaaata aataattgca caatttacct gtaaattgcg 420 aaatgaatct tttgaatcta ggtagtacat aattgaataa tatttgccaa atacaaacga 480 aagtgttaca ggagcgaaat tcaaaaaaat ttctcaacta accaagatct tagtttattt 540 tttatctata tttaatactc aatgcatgcg ctaaacattc aatgtgatga ttgaaaactt 600 aggacatgag acgtatttgg gaacccaaat ctaaaatgaa tctctaacac aataactata 660 gcctccatag agtaaccata tagaagactc attttgggta tcaggagagg cataaatcca 720 aatttgggta ccctctctcc tcgagaccca tttgtagaga atgttgtctt ttaggtcttg 780 ttgaagaaga ctaaaaatag gtatagagcg ctacacaaaa gacaaatgag tcttatattt 840 tgggtgatga ttgttgggag aaaatggcta ttataggact ccaatcgttg tggcttctct 900 caaatagaac tctaaatatt tgtttctccc aaataggact ctaaacattg tttgtaagct 960 ctaataacat tttaatacta ttaacccatg ttttaagtct ttaaatatat gtatttgacc 1020 atgacatgac tcttttgccc ctgggaccat tgtgccgccg acatagaacc acggcgagac 1080 gactctcact ttgccaccgg agaacgcgtt catggccacc ggagggagaa ggaacccatc 1140 gagcgccagc cccgcgaacg aggcgaggtc ctcccatagc gtagctggct ctggggccaa 1200 cacgaacggc gagagaagtc ctcccatgca gcgcagcakc agacatatcg ctcctctcgg 1260 ctgccgccgc cgctgtcccg tccttgctct gtcggcagcc gagctccgag cagaccaagc 1320 tagttgaagg aggcatagct gaagcacgaa cgccatcagc gtgggcaccc tcagcatcgt 1380 ctgcttcagc tcgtacgaca cttttctctt ggcgttggag gatgtgtagg gccgtgaaga 1440 tgcacggcag cgtcgtggag acaaccagca gcatgacact ctccaggtcc atccgcgaga 1500 tggatgcgga caccggcctg ctcaacgtac atcccgagta gagcagtaaa gtccaggggc 1560 aaaagagtca agccatggcc aaatacatat atttmaagac ttaaaacatg gctaataata 1620 ttaaaatgtt attagagctt acaaagaatg tttagagttc tatttgagag aaacaaatat 1680 ttggagtcct atttagaggt catataataa ccattttgtc ttagatttgg gaacgaaaaa 1740 aggccttacc ccgcaaggcc tccaattgcc tccctctctc tctttcvctc tctctccggc 1800 cacggtcgtc gctgcgccgc cgcacagagg ccggggcgtg gacgctacac gcatccgtyt 1860 ctcctcctgs tggccgcgct gtcttcctat tctccaaggg cttgattggg ggaatgtgat 1920 gtgataccgt cagcgatggt ccgtcaaccg acagggagat ttcgtcgggc gtctggcggg 1980 ctggaccaac gagacgacga ccggccactt ctttctcgct gggcggctca agaaatcgca 2040 ggtttccacg cagtccctgt cgttcgggct caaattgtcg ttacgtcaca ggcgagtagt 2100 tctttgacta atctttgatt gatttggcag ttgactgttg cagtttgata tctgatagtg 2160 atttgtccgc aggcggtggc tcaactgctc ggtgagagtg aaatatggct aatcgaggcc 2220 ctttacttgc tttcctgttg acgacttgag tagccttgaa gtttgcagta gtgtgccttg 2280 tgcaattccg cacttctttt cctatgattt cataagtgct tcatgtcttt aatttaatct 2340 tgacagattc agtgatcttt caaatctgct ctgtgcaatt tgagaacccc ctgtcttttg 2400 gcatgaccgc acactagcta gacaaggact gctgcttaga ggtcacatta ttactagcta 2460 gttgttttcc tcccttttat tggagagtgc tcatcttttc cttgctcatg tgtgcttggc 2520 agtaagcatc tttaagaggc tatactccct aaacctgctc ctttggaagg cactatattt 2580 gctaatttat ttagcattgt ttagcttgtt ggaaactggg gctgattggt atgaagtata 2640 aacaatgtga aataatattc tagtctgtac tgctcaaaca actctcactg tcctatgttg 2700 aattataatt tttttcaggt catgggcgtt catattgtca catattactg aaaaaatggg 2760 ccactgtgga aacaggtcca gtttacatag tagtcatctt agtagacact gacagatcct 2820 ctagtggttt gactaatcat agattccctt aagaaaatct ttacaacttt gctctaaacc 2880 tatcacctta tttatctatt ctttttaatc ttaaggttcc atgtcctttc cctgaatatt 2940 ataatttagg tatatgacct gtgatgtttg tcttctgcat tcacctaaat tatttttagg 3000 gatatacaaa tcttttatca ttaggtggct tcactagtaa gacacacatt ttatcagctt 3060 tatcttctaa ctagttgatt gaccttgcaa tgtgggtgat tcacactttt ttcacacttg 3120 cattcttgac cacttgacta gtcttgaaat gctttttgtg cacctttttt tatgcatcaa 3180 ggaagtgatt gttgtgtaca ccttacctgg cttacctgct tactcctttt attctgttct 3240 tatacttcag tcgcttcctt ttcttgtgca tgtggagttg acttactgag catgtataag 3300 aagagccctc agacatgaaa gttgcacagg cacaaatact gacaccagaa taccagatgc 3360 agtgaggggt gggggatgga agctattgtg tgtgcatcac atggagccat gggctccttg 3420 ctgtggaagc tgggtgcatt gctctctgat gagtacaagc ttctaagtag tgtgaaggtg 3480 gatatgatgt tccttaaagc tgagcttgag gtcatgcatg ctttcctgaa gaagatgtcg 3540 gaggtggagg ttcctgatga gcagtccaag tgctggatga aggaggtgcg ggagctgtcc 3600 tacgacattg aggacagcat cgacagtttc atattttctc ttggttgtga gtccaacagc 3660 gaacctagag gcttcaaggg gtttgttggt aggtgcttga acttgtttgc cgatgctaag 3720 acacgtcatt ggattgccaa gaagatccaa cgtctcaaat gccatgttat agaggctagc 3780 aatcggcgtg ggaggtacag ggttgatgat gctgtcccca gattgagtag aacaagcata 3840 gaccctcgct tgccggcatt gtacactgag acaacaaggc ttgttggtgt tgatggccca 3900 agggacaaac ttatcaagtt gctaacagaa agagagggca caacgacaca gctgagtgtg 3960 gtctccattg ttggatttgg tgggcttggg aagactactc ttgccaatga agtgtaccag 4020 aagcttgaag ggcaatttga ctacagagct tttgtgtcag tgtcacaaaa acctgacatt 4080 aagaagatat tgaggcatat actctgccag tatagttgcc gagagtgtgg caacaatgaa 4140 atatgggatg agcagcaact catcaacaca acaaggcagt tccttaagga taagcggtat 4200 gcatcatata tttgtttctt ttcatgacaa tcttgctttg aaggaatata catgtgccta 4260 tttattgatt tgtgcaaatt cgtgttaggt actttattgt aattgatgat atatggagca 4320 tatcagcatg gagaactatc agatgtgctt ttcctgaaaa taattgttcc agtagaatat 4380 tgacaactac tcgcatcatc acagtcgcta agtattgttg ctctcctcac cgcgaccatg 4440 tttatgaact gaagcctctc gatgcagctc actctaagag cttatttttt aatagaattt 4500 ttggttctga agatagatgt cctcttcatc tgaaagaagt ttccaatgga atattgaaaa 4560 aatgtggtgg cttaccattg gcaatcatta ccgtagctag tttattggtc actaaagcta 4620 ttacaaaaga agaatgggag aagatgctga agtctattgg ttcagcactt gaaaaagata 4680 cagatatgga agagatgaag aagattcttt tgcttagtta caatgatctc ccctaccact 4740 tgaagacatg cttgttatat ctaggtgtgt ttcctgaaga ttatgagatt aagagggatc 4800 gactgataag aagatggatt gctgaaggtt ttatcactac agaaggtgga caagacatgg 4860 aggaaatagg agaatgctat ttcaatgaac ttatcaacag gagcatgatt cagcctgttg 4920 gaattcagta tgatggtcgg gctgatgctt gccgtgtcca tgatatgatt cttgatctca 4980 tcatatccaa atcagttgaa gaaaactttc taaccttatg tggtgatgga aatcacaagc 5040 tgttgcaaca ggataaggtt cgtcggctat ccatcaatta tcatgctcga gatgatatta 5100 tagtaccaac aaatatgatt gtttctaatg ttcgatccct cactatattt gggtatgatg 5160 agaatatgcc tggtctttcc aactttctac tcttgcgagt tctggatcta gaaaatagag 5220 tggtgctgga atacaattac cttagacaca taggtaggct ttctcagttg aggtacctcc 5280 gactcagttc aagaagaata actgcacttc ctgaacaaat aggagatcta caaaacttgc 5340 agaccttaga tctgaggtgg acaaggatta aaagattgcc acagagtgtt gtcctcctac 5400 gacgattgac atgcttgttg gttaacagtt tagaattgcc tgaaggaatt ggaaatatgc 5460 aagctctaca ggagttatca gagattgaaa ttaactgtca tacatcagtg tcctctttgc 5520 tggaactggg caaattgact aatataagaa ttcttgggct aaattggtgc atcctcgata 5580 caaattatgt aacaaaaatc catgcagata gtttggttat gtccctctgc aaattaggca 5640 tgctcaatct tcgatcgata cagattcaaa gctatcatag ttgttccctt gatttcttaa 5700 aggattcttg gtttcctcct cctcgtcgcc tccaaatatt tgatatgtcc atagattacc 5760 atttccccag aattccaaac tggataatct cactggagta cctcagttac ctagacatat 5820 atcttactcc agtggatgag gaatcatttc gaaccctggg ggatttgccc tctctgttgt 5880 tcctctggat atcctcaaga gaagcaaagc ctaaagaagg ggttattgtc agcagcaatg 5940 gtttccgctg tctgaaggag ttctacttca cctgttggga aattggtaca gggctgtctt 6000 ttgaaccagg agccatgcca atgcttgaaa aactgcggat tccattcaac gcgcatggtg 6060 tgtgctcttt gcacggtgtt ctggattctg gaatctggca cctctgttcc ctaaggcatc 6120 tccatgtcga gattatttgt catggtgcga ggcttaagga ggtggaggcc gtggaggaag 6180 ctgtcaagaa tgcagccagc tacctttctg atgagctctt ccttgatgta agaagatggg 6240 acgaagaaga gattttgaag gacgaggagc ataaactgga ggaagaggag tttagttctt 6300 atgattcaaa aagcataact caatactgaa gtttgatttc ttctcttaat cacaagcatg 6360 cacatgtatt tctgtttcta ctgactagca cattgtaggg tgcctacact attgacttta 6420 gcgcttgtga agagctgatt cggcacgatc atgattcaaa caaattagtg gtccattata 6480 ttcagtgaca caataatttg ttcttccttt ttcaaatttg aagtttggga aaatatacta 6540 gatgcaactg caagcatttt ggaacccctg ctcctctgat tacttggtcc ttaaattctc 6600 ctctaattgc ttgaaggctt gcttgtcctt caaatctgaa gctttctgtc cttcaaatct 6660 gaagctttct aaaagctggt taccgtggtc cccggggact ccttcaaatt tgaagttttt 6720 ttttctcgaa tccttcaaat tcgaatcaag aactgtctgc 6760 34 2954 DNA Sorghum sp. 34 atggaagcta ttgtgtgtgc atcacatgga gccatgggct ccttgctgtg gaagctgggt 60 gcattgctct ctgatgagta caagcttcta agtagtgtga aggtggatat gatgttcctt 120 aaagctgagc ttgaggtcat gcatgctttc ctgaagaaga tgtcggaggt ggaggttcct 180 gatgagcagt ccaagtgctg gatgaaggag gtgcgggagc tgtcctacga cattgaggac 240 agcatcgaca gtttcatatt ttctcttggt tgtgagtcca acagcgaacc tagaggcttc 300 aaggggtttg ttggtaggtg cttgaacttg tttgccgatg ctaagacacg tcattggatt 360 gccaagaaga tccaacgtct caaatgccat gttatagagg ctagcaatcg gcgtgggagg 420 tacagggttg atgatgctgt ccccagattg agtagaacaa gcatagaccc tcgcttgccg 480 gcattgtaca ctgagacaac aaggcttgtt ggtgttgatg gcccaaggga caaacttatc 540 aagttgctaa cagaaagaga gggcacaacg acacagctga gtgtggtctc cattgttgga 600 tttggtgggc ttgggaagac tactcttgcc aatgaagtgt accagaagct tgaagggcaa 660 tttgactaca gagcttttgt gtcagtgtca caaaaacctg acattaagaa gatattgagg 720 catatactct gccagtatag ttgccgagag tgtggcaaca atgaaatatg ggatgagcag 780 caactcatca acacaacaag gcagttcctt aaggataagc ggtatgcatc atatatttgt 840 ttcttttcat gacaatcttg ctttgaagga atatacatgt gcctatttat tgatttgtgc 900 aaattcgtgt taggtacttt attgtaattg atgatatatg gagcatatca gcatggagaa 960 ctatcagatg tgcttttcct gaaaataatt gttccagtag aatattgaca actactcgca 1020 tcatcacagt cgctaagtat tgttgctctc ctcaccgcga ccatgtttat gaactgaagc 1080 ctctcgatgc agctcactct aagagcttat tttttaatag aatttttggt tctgaagata 1140 gatgtcctct tcatctgaaa gaagtttcca atggaatatt gaaaaaatgt ggtggcttac 1200 cattggcaat cattaccgta gctagtttat tggtcactaa agctattaca aaagaagaat 1260 gggagaagat gctgaagtct attggttcag cacttgaaaa agatacagat atggaagaga 1320 tgaagaagat tcttttgctt agttacaatg atctccccta ccacttgaag acatgcttgt 1380 tatatctagg tgtgtttcct gaagattatg agattaagag ggatcgactg ataagaagat 1440 ggattgctga aggttttatc actacagaag gtggacaaga catggaggaa ataggagaat 1500 gctatttcaa tgaacttatc aacaggagca tgattcagcc tgttggaatt cagtatgatg 1560 gtcgggctga tgcttgccgt gtccatgata tgattcttga tctcatcata tccaaatcag 1620 ttgaagaaaa ctttctaacc ttatgtggtg atggaaatca caagctgttg caacaggata 1680 aggttcgtcg gctatccatc aattatcatg ctcgagatga tattatagta ccaacaaata 1740 tgattgtttc taatgttcga tccctcacta tatttgggta tgatgagaat atgcctggtc 1800 tttccaactt tctactcttg cgagttctgg atctagaaaa tagagtggtg ctggaataca 1860 attaccttag acacataggt aggctttctc agttgaggta cctccgactc agttcaagaa 1920 gaataactgc acttcctgaa caaataggag atctacaaaa cttgcagacc ttagatctga 1980 ggtggacaag gattaaaaga ttgccacaga gtgttgtcct cctacgacga ttgacatgct 2040 tgttggttaa cagtttagaa ttgcctgaag gaattggaaa tatgcaagct ctacaggagt 2100 tatcagagat tgaaattaac tgtcatacat cagtgtcctc tttgctggaa ctgggcaaat 2160 tgactaatat aagaattctt gggctaaatt ggtgcatcct cgatacaaat tatgtaacaa 2220 aaatccatgc agatagtttg gttatgtccc tctgcaaatt aggcatgctc aatcttcgat 2280 cgatacagat tcaaagctat catagttgtt cccttgattt cttaaaggat tcttggtttc 2340 ctcctcctcg tcgcctccaa atatttgata tgtccataga ttaccatttc cccagaattc 2400 caaactggat aatctcactg gagtacctca gttacctaga catatatctt actccagtgg 2460 atgaggaatc atttcgaacc ctgggggatt tgccctctct gttgttcctc tggatatcct 2520 caagagaagc aaagcctaaa gaaggggtta ttgtcagcag caatggtttc cgctgtctga 2580 aggagttcta cttcacctgt tgggaaattg gtacagggct gtcttttgaa ccaggagcca 2640 tgccaatgct tgaaaaactg cggattccat tcaacgcgca tggtgtgtgc tctttgcacg 2700 gtgttctgga ttctggaatc tggcacctct gttccctaag gcatctccat gtcgagatta 2760 tttgtcatgg tgcgaggctt aaggaggtgg aggccgtgga ggaagctgtc aagaatgcag 2820 ccagctacct ttctgatgag ctcttccttg atgtaagaag atgggacgaa gaagagattt 2880 tgaaggacga ggagcataaa ctggaggaag aggagtttag ttcttatgat tcaaaaagca 2940 taactcaata ctga 2954 35 2862 DNA Sorghum sp. CDS (1)...(2859) 35 atg gaa gct att gtg tgt gca tca cat gga gcc atg ggc tcc ttg ctg 48 Met Glu Ala Ile Val Cys Ala Ser His Gly Ala Met Gly Ser Leu Leu 1 5 10 15 tgg aag ctg ggt gca ttg ctc tct gat gag tac aag ctt cta agt agt 96 Trp Lys Leu Gly Ala Leu Leu Ser Asp Glu Tyr Lys Leu Leu Ser Ser 20 25 30 gtg aag gtg gat atg atg ttc ctt aaa gct gag ctt gag gtc atg cat 144 Val Lys Val Asp Met Met Phe Leu Lys Ala Glu Leu Glu Val Met His 35 40 45 gct ttc ctg aag aag atg tcg gag gtg gag gtt cct gat gag cag tcc 192 Ala Phe Leu Lys Lys Met Ser Glu Val Glu Val Pro Asp Glu Gln Ser 50 55 60 aag tgc tgg atg aag gag gtg cgg gag ctg tcc tac gac att gag gac 240 Lys Cys Trp Met Lys Glu Val Arg Glu Leu Ser Tyr Asp Ile Glu Asp 65 70 75 80 agc atc gac agt ttc ata ttt tct ctt ggt tgt gag tcc aac agc gaa 288 Ser Ile Asp Ser Phe Ile Phe Ser Leu Gly Cys Glu Ser Asn Ser Glu 85 90 95 cct aga ggc ttc aag ggg ttt gtt ggt agg tgc ttg aac ttg ttt gcc 336 Pro Arg Gly Phe Lys Gly Phe Val Gly Arg Cys Leu Asn Leu Phe Ala 100 105 110 gat gct aag aca cgt cat tgg att gcc aag aag atc caa cgt ctc aaa 384 Asp Ala Lys Thr Arg His Trp Ile Ala Lys Lys Ile Gln Arg Leu Lys 115 120 125 tgc cat gtt ata gag gct agc aat cgg cgt ggg agg tac agg gtt gat 432 Cys His Val Ile Glu Ala Ser Asn Arg Arg Gly Arg Tyr Arg Val Asp 130 135 140 gat gct gtc ccc aga ttg agt aga aca agc ata gac cct cgc ttg ccg 480 Asp Ala Val Pro Arg Leu Ser Arg Thr Ser Ile Asp Pro Arg Leu Pro 145 150 155 160 gca ttg tac act gag aca aca agg ctt gtt ggt gtt gat ggc cca agg 528 Ala Leu Tyr Thr Glu Thr Thr Arg Leu Val Gly Val Asp Gly Pro Arg 165 170 175 gac aaa ctt atc aag ttg cta aca gaa aga gag ggc aca acg aca cag 576 Asp Lys Leu Ile Lys Leu Leu Thr Glu Arg Glu Gly Thr Thr Thr Gln 180 185 190 ctg agt gtg gtc tcc att gtt gga ttt ggt ggg ctt ggg aag act act 624 Leu Ser Val Val Ser Ile Val Gly Phe Gly Gly Leu Gly Lys Thr Thr 195 200 205 ctt gcc aat gaa gtg tac cag aag ctt gaa ggg caa ttt gac tac aga 672 Leu Ala Asn Glu Val Tyr Gln Lys Leu Glu Gly Gln Phe Asp Tyr Arg 210 215 220 gct ttt gtg tca gtg tca caa aaa cct gac att aag aag ata ttg agg 720 Ala Phe Val Ser Val Ser Gln Lys Pro Asp Ile Lys Lys Ile Leu Arg 225 230 235 240 cat ata ctc tgc cag tat agt tgc cga gag tgt ggc aac aat gaa ata 768 His Ile Leu Cys Gln Tyr Ser Cys Arg Glu Cys Gly Asn Asn Glu Ile 245 250 255 tgg gat gag cag caa ctc atc aac aca aca agg cag ttc ctt aag gat 816 Trp Asp Glu Gln Gln Leu Ile Asn Thr Thr Arg Gln Phe Leu Lys Asp 260 265 270 aag cgg tac ttt att gta att gat gat ata tgg agc ata tca gca tgg 864 Lys Arg Tyr Phe Ile Val Ile Asp Asp Ile Trp Ser Ile Ser Ala Trp 275 280 285 aga act atc aga tgt gct ttt cct gaa aat aat tgt tcc agt aga ata 912 Arg Thr Ile Arg Cys Ala Phe Pro Glu Asn Asn Cys Ser Ser Arg Ile 290 295 300 ttg aca act act cgc atc atc aca gtc gct aag tat tgt tgc tct cct 960 Leu Thr Thr Thr Arg Ile Ile Thr Val Ala Lys Tyr Cys Cys Ser Pro 305 310 315 320 cac cgc gac cat gtt tat gaa ctg aag cct ctc gat gca gct cac tct 1008 His Arg Asp His Val Tyr Glu Leu Lys Pro Leu Asp Ala Ala His Ser 325 330 335 aag agc tta ttt ttt aat aga att ttt ggt tct gaa gat aga tgt cct 1056 Lys Ser Leu Phe Phe Asn Arg Ile Phe Gly Ser Glu Asp Arg Cys Pro 340 345 350 ctt cat ctg aaa gaa gtt tcc aat gga ata ttg aaa aaa tgt ggt ggc 1104 Leu His Leu Lys Glu Val Ser Asn Gly Ile Leu Lys Lys Cys Gly Gly 355 360 365 tta cca ttg gca atc att acc gta gct agt tta ttg gtc act aaa gct 1152 Leu Pro Leu Ala Ile Ile Thr Val Ala Ser Leu Leu Val Thr Lys Ala 370 375 380 att aca aaa gaa gaa tgg gag aag atg ctg aag tct att ggt tca gca 1200 Ile Thr Lys Glu Glu Trp Glu Lys Met Leu Lys Ser Ile Gly Ser Ala 385 390 395 400 ctt gaa aaa gat aca gat atg gaa gag atg aag aag att ctt ttg ctt 1248 Leu Glu Lys Asp Thr Asp Met Glu Glu Met Lys Lys Ile Leu Leu Leu 405 410 415 agt tac aat gat ctc ccc tac cac ttg aag aca tgc ttg tta tat cta 1296 Ser Tyr Asn Asp Leu Pro Tyr His Leu Lys Thr Cys Leu Leu Tyr Leu 420 425 430 ggt gtg ttt cct gaa gat tat gag att aag agg gat cga ctg ata aga 1344 Gly Val Phe Pro Glu Asp Tyr Glu Ile Lys Arg Asp Arg Leu Ile Arg 435 440 445 aga tgg att gct gaa ggt ttt atc act aca gaa ggt gga caa gac atg 1392 Arg Trp Ile Ala Glu Gly Phe Ile Thr Thr Glu Gly Gly Gln Asp Met 450 455 460 gag gaa ata gga gaa tgc tat ttc aat gaa ctt atc aac agg agc atg 1440 Glu Glu Ile Gly Glu Cys Tyr Phe Asn Glu Leu Ile Asn Arg Ser Met 465 470 475 480 att cag cct gtt gga att cag tat gat ggt cgg gct gat gct tgc cgt 1488 Ile Gln Pro Val Gly Ile Gln Tyr Asp Gly Arg Ala Asp Ala Cys Arg 485 490 495 gtc cat gat atg att ctt gat ctc atc ata tcc aaa tca gtt gaa gaa 1536 Val His Asp Met Ile Leu Asp Leu Ile Ile Ser Lys Ser Val Glu Glu 500 505 510 aac ttt cta acc tta tgt ggt gat gga aat cac aag ctg ttg caa cag 1584 Asn Phe Leu Thr Leu Cys Gly Asp Gly Asn His Lys Leu Leu Gln Gln 515 520 525 gat aag gtt cgt cgg cta tcc atc aat tat cat gct cga gat gat att 1632 Asp Lys Val Arg Arg Leu Ser Ile Asn Tyr His Ala Arg Asp Asp Ile 530 535 540 ata gta cca aca aat atg att gtt tct aat gtt cga tcc ctc act ata 1680 Ile Val Pro Thr Asn Met Ile Val Ser Asn Val Arg Ser Leu Thr Ile 545 550 555 560 ttt ggg tat gat gag aat atg cct ggt ctt tcc aac ttt cta ctc ttg 1728 Phe Gly Tyr Asp Glu Asn Met Pro Gly Leu Ser Asn Phe Leu Leu Leu 565 570 575 cga gtt ctg gat cta gaa aat aga gtg gtg ctg gaa tac aat tac ctt 1776 Arg Val Leu Asp Leu Glu Asn Arg Val Val Leu Glu Tyr Asn Tyr Leu 580 585 590 aga cac ata ggt agg ctt tct cag ttg agg tac ctc cga ctc agt tca 1824 Arg His Ile Gly Arg Leu Ser Gln Leu Arg Tyr Leu Arg Leu Ser Ser 595 600 605 aga aga ata act gca ctt cct gaa caa ata gga gat cta caa aac ttg 1872 Arg Arg Ile Thr Ala Leu Pro Glu Gln Ile Gly Asp Leu Gln Asn Leu 610 615 620 cag acc tta gat ctg agg tgg aca agg att aaa aga ttg cca cag agt 1920 Gln Thr Leu Asp Leu Arg Trp Thr Arg Ile Lys Arg Leu Pro Gln Ser 625 630 635 640 gtt gtc ctc cta cga cga ttg aca tgc ttg ttg gtt aac agt tta gaa 1968 Val Val Leu Leu Arg Arg Leu Thr Cys Leu Leu Val Asn Ser Leu Glu 645 650 655 ttg cct gaa gga att gga aat atg caa gct cta cag gag tta tca gag 2016 Leu Pro Glu Gly Ile Gly Asn Met Gln Ala Leu Gln Glu Leu Ser Glu 660 665 670 att gaa att aac tgt cat aca tca gtg tcc tct ttg ctg gaa ctg ggc 2064 Ile Glu Ile Asn Cys His Thr Ser Val Ser Ser Leu Leu Glu Leu Gly 675 680 685 aaa ttg act aat ata aga att ctt ggg cta aat tgg tgc atc ctc gat 2112 Lys Leu Thr Asn Ile Arg Ile Leu Gly Leu Asn Trp Cys Ile Leu Asp 690 695 700 aca aat tat gta aca aaa atc cat gca gat agt ttg gtt atg tcc ctc 2160 Thr Asn Tyr Val Thr Lys Ile His Ala Asp Ser Leu Val Met Ser Leu 705 710 715 720 tgc aaa tta ggc atg ctc aat ctt cga tcg ata cag att caa agc tat 2208 Cys Lys Leu Gly Met Leu Asn Leu Arg Ser Ile Gln Ile Gln Ser Tyr 725 730 735 cat agt tgt tcc ctt gat ttc tta aag gat tct tgg ttt cct cct cct 2256 His Ser Cys Ser Leu Asp Phe Leu Lys Asp Ser Trp Phe Pro Pro Pro 740 745 750 cgt cgc ctc caa ata ttt gat atg tcc ata gat tac cat ttc ccc aga 2304 Arg Arg Leu Gln Ile Phe Asp Met Ser Ile Asp Tyr His Phe Pro Arg 755 760 765 att cca aac tgg ata atc tca ctg gag tac ctc agt tac cta gac ata 2352 Ile Pro Asn Trp Ile Ile Ser Leu Glu Tyr Leu Ser Tyr Leu Asp Ile 770 775 780 tat ctt act cca gtg gat gag gaa tca ttt cga acc ctg ggg gat ttg 2400 Tyr Leu Thr Pro Val Asp Glu Glu Ser Phe Arg Thr Leu Gly Asp Leu 785 790 795 800 ccc tct ctg ttg ttc ctc tgg ata tcc tca aga gaa gca aag cct aaa 2448 Pro Ser Leu Leu Phe Leu Trp Ile Ser Ser Arg Glu Ala Lys Pro Lys 805 810 815 gaa ggg gtt att gtc agc agc aat ggt ttc cgc tgt ctg aag gag ttc 2496 Glu Gly Val Ile Val Ser Ser Asn Gly Phe Arg Cys Leu Lys Glu Phe 820 825 830 tac ttc acc tgt tgg gaa att ggt aca ggg ctg tct ttt gaa cca gga 2544 Tyr Phe Thr Cys Trp Glu Ile Gly Thr Gly Leu Ser Phe Glu Pro Gly 835 840 845 gcc atg cca atg ctt gaa aaa ctg cgg att cca ttc aac gcg cat ggt 2592 Ala Met Pro Met Leu Glu Lys Leu Arg Ile Pro Phe Asn Ala His Gly 850 855 860 gtg tgc tct ttg cac ggt gtt ctg gat tct gga atc tgg cac ctc tgt 2640 Val Cys Ser Leu His Gly Val Leu Asp Ser Gly Ile Trp His Leu Cys 865 870 875 880 tcc cta agg cat ctc cat gtc gag att att tgt cat ggt gcg agg ctt 2688 Ser Leu Arg His Leu His Val Glu Ile Ile Cys His Gly Ala Arg Leu 885 890 895 aag gag gtg gag gcc gtg gag gaa gct gtc aag aat gca gcc agc tac 2736 Lys Glu Val Glu Ala Val Glu Glu Ala Val Lys Asn Ala Ala Ser Tyr 900 905 910 ctt tct gat gag ctc ttc ctt gat gta aga aga tgg gac gaa gaa gag 2784 Leu Ser Asp Glu Leu Phe Leu Asp Val Arg Arg Trp Asp Glu Glu Glu 915 920 925 att ttg aag gac gag gag cat aaa ctg gag gaa gag gag ttt agt tct 2832 Ile Leu Lys Asp Glu Glu His Lys Leu Glu Glu Glu Glu Phe Ser Ser 930 935 940 tat gat tca aaa agc ata act caa tac tga 2862 Tyr Asp Ser Lys Ser Ile Thr Gln Tyr 945 950 36 953 PRT Sorghum sp. 36 Met Glu Ala Ile Val Cys Ala Ser His Gly Ala Met Gly Ser Leu Leu 1 5 10 15 Trp Lys Leu Gly Ala Leu Leu Ser Asp Glu Tyr Lys Leu Leu Ser Ser 20 25 30 Val Lys Val Asp Met Met Phe Leu Lys Ala Glu Leu Glu Val Met His 35 40 45 Ala Phe Leu Lys Lys Met Ser Glu Val Glu Val Pro Asp Glu Gln Ser 50 55 60 Lys Cys Trp Met Lys Glu Val Arg Glu Leu Ser Tyr Asp Ile Glu Asp 65 70 75 80 Ser Ile Asp Ser Phe Ile Phe Ser Leu Gly Cys Glu Ser Asn Ser Glu 85 90 95 Pro Arg Gly Phe Lys Gly Phe Val Gly Arg Cys Leu Asn Leu Phe Ala 100 105 110 Asp Ala Lys Thr Arg His Trp Ile Ala Lys Lys Ile Gln Arg Leu Lys 115 120 125 Cys His Val Ile Glu Ala Ser Asn Arg Arg Gly Arg Tyr Arg Val Asp 130 135 140 Asp Ala Val Pro Arg Leu Ser Arg Thr Ser Ile Asp Pro Arg Leu Pro 145 150 155 160 Ala Leu Tyr Thr Glu Thr Thr Arg Leu Val Gly Val Asp Gly Pro Arg 165 170 175 Asp Lys Leu Ile Lys Leu Leu Thr Glu Arg Glu Gly Thr Thr Thr Gln 180 185 190 Leu Ser Val Val Ser Ile Val Gly Phe Gly Gly Leu Gly Lys Thr Thr 195 200 205 Leu Ala Asn Glu Val Tyr Gln Lys Leu Glu Gly Gln Phe Asp Tyr Arg 210 215 220 Ala Phe Val Ser Val Ser Gln Lys Pro Asp Ile Lys Lys Ile Leu Arg 225 230 235 240 His Ile Leu Cys Gln Tyr Ser Cys Arg Glu Cys Gly Asn Asn Glu Ile 245 250 255 Trp Asp Glu Gln Gln Leu Ile Asn Thr Thr Arg Gln Phe Leu Lys Asp 260 265 270 Lys Arg Tyr Phe Ile Val Ile Asp Asp Ile Trp Ser Ile Ser Ala Trp 275 280 285 Arg Thr Ile Arg Cys Ala Phe Pro Glu Asn Asn Cys Ser Ser Arg Ile 290 295 300 Leu Thr Thr Thr Arg Ile Ile Thr Val Ala Lys Tyr Cys Cys Ser Pro 305 310 315 320 His Arg Asp His Val Tyr Glu Leu Lys Pro Leu Asp Ala Ala His Ser 325 330 335 Lys Ser Leu Phe Phe Asn Arg Ile Phe Gly Ser Glu Asp Arg Cys Pro 340 345 350 Leu His Leu Lys Glu Val Ser Asn Gly Ile Leu Lys Lys Cys Gly Gly 355 360 365 Leu Pro Leu Ala Ile Ile Thr Val Ala Ser Leu Leu Val Thr Lys Ala 370 375 380 Ile Thr Lys Glu Glu Trp Glu Lys Met Leu Lys Ser Ile Gly Ser Ala 385 390 395 400 Leu Glu Lys Asp Thr Asp Met Glu Glu Met Lys Lys Ile Leu Leu Leu 405 410 415 Ser Tyr Asn Asp Leu Pro Tyr His Leu Lys Thr Cys Leu Leu Tyr Leu 420 425 430 Gly Val Phe Pro Glu Asp Tyr Glu Ile Lys Arg Asp Arg Leu Ile Arg 435 440 445 Arg Trp Ile Ala Glu Gly Phe Ile Thr Thr Glu Gly Gly Gln Asp Met 450 455 460 Glu Glu Ile Gly Glu Cys Tyr Phe Asn Glu Leu Ile Asn Arg Ser Met 465 470 475 480 Ile Gln Pro Val Gly Ile Gln Tyr Asp Gly Arg Ala Asp Ala Cys Arg 485 490 495 Val His Asp Met Ile Leu Asp Leu Ile Ile Ser Lys Ser Val Glu Glu 500 505 510 Asn Phe Leu Thr Leu Cys Gly Asp Gly Asn His Lys Leu Leu Gln Gln 515 520 525 Asp Lys Val Arg Arg Leu Ser Ile Asn Tyr His Ala Arg Asp Asp Ile 530 535 540 Ile Val Pro Thr Asn Met Ile Val Ser Asn Val Arg Ser Leu Thr Ile 545 550 555 560 Phe Gly Tyr Asp Glu Asn Met Pro Gly Leu Ser Asn Phe Leu Leu Leu 565 570 575 Arg Val Leu Asp Leu Glu Asn Arg Val Val Leu Glu Tyr Asn Tyr Leu 580 585 590 Arg His Ile Gly Arg Leu Ser Gln Leu Arg Tyr Leu Arg Leu Ser Ser 595 600 605 Arg Arg Ile Thr Ala Leu Pro Glu Gln Ile Gly Asp Leu Gln Asn Leu 610 615 620 Gln Thr Leu Asp Leu Arg Trp Thr Arg Ile Lys Arg Leu Pro Gln Ser 625 630 635 640 Val Val Leu Leu Arg Arg Leu Thr Cys Leu Leu Val Asn Ser Leu Glu 645 650 655 Leu Pro Glu Gly Ile Gly Asn Met Gln Ala Leu Gln Glu Leu Ser Glu 660 665 670 Ile Glu Ile Asn Cys His Thr Ser Val Ser Ser Leu Leu Glu Leu Gly 675 680 685 Lys Leu Thr Asn Ile Arg Ile Leu Gly Leu Asn Trp Cys Ile Leu Asp 690 695 700 Thr Asn Tyr Val Thr Lys Ile His Ala Asp Ser Leu Val Met Ser Leu 705 710 715 720 Cys Lys Leu Gly Met Leu Asn Leu Arg Ser Ile Gln Ile Gln Ser Tyr 725 730 735 His Ser Cys Ser Leu Asp Phe Leu Lys Asp Ser Trp Phe Pro Pro Pro 740 745 750 Arg Arg Leu Gln Ile Phe Asp Met Ser Ile Asp Tyr His Phe Pro Arg 755 760 765 Ile Pro Asn Trp Ile Ile Ser Leu Glu Tyr Leu Ser Tyr Leu Asp Ile 770 775 780 Tyr Leu Thr Pro Val Asp Glu Glu Ser Phe Arg Thr Leu Gly Asp Leu 785 790 795 800 Pro Ser Leu Leu Phe Leu Trp Ile Ser Ser Arg Glu Ala Lys Pro Lys 805 810 815 Glu Gly Val Ile Val Ser Ser Asn Gly Phe Arg Cys Leu Lys Glu Phe 820 825 830 Tyr Phe Thr Cys Trp Glu Ile Gly Thr Gly Leu Ser Phe Glu Pro Gly 835 840 845 Ala Met Pro Met Leu Glu Lys Leu Arg Ile Pro Phe Asn Ala His Gly 850 855 860 Val Cys Ser Leu His Gly Val Leu Asp Ser Gly Ile Trp His Leu Cys 865 870 875 880 Ser Leu Arg His Leu His Val Glu Ile Ile Cys His Gly Ala Arg Leu 885 890 895 Lys Glu Val Glu Ala Val Glu Glu Ala Val Lys Asn Ala Ala Ser Tyr 900 905 910 Leu Ser Asp Glu Leu Phe Leu Asp Val Arg Arg Trp Asp Glu Glu Glu 915 920 925 Ile Leu Lys Asp Glu Glu His Lys Leu Glu Glu Glu Glu Phe Ser Ser 930 935 940 Tyr Asp Ser Lys Ser Ile Thr Gln Tyr 945 950 37 93 PRT Zea mays 37 Phe Leu Leu Val Leu Asp Asp Val Trp Asn Glu Asp Pro Glu Lys Trp 1 5 10 15 Asp Arg Tyr Arg Cys Ala Leu Leu Ser Gly Gly Lys Gly Ser Arg Ile 20 25 30 Ile Ile Thr Thr Arg Asn Lys Asn Val Gly Ile Leu Met Gly Gly Met 35 40 45 Thr Pro Tyr His Leu Lys Gln Leu Ser Asn Asp Asp Cys Trp Gln Leu 50 55 60 Phe Lys Lys His Ala Phe Val Asp Gly Asp Ser Ser Ser His Pro Glu 65 70 75 80 Leu Glu Ile Ile Gly Lys Asp Ile Val Lys Lys Leu Lys 85 90 38 93 PRT Zea mays 38 Tyr Phe Val Val Ile Asp Asp Ile Trp Ser Thr Ser Ala Trp Arg Ile 1 5 10 15 Ile Arg Cys Ala Phe Pro Glu Asn Asn Cys Ser Ser Arg Ile Leu Thr 20 25 30 Thr Thr Arg Ile Val Thr Val Ala Lys Tyr Cys Cys Ser Pro His Leu 35 40 45 Asp Arg Val Tyr Ala Leu Glu Pro Leu Asp Ala Ala His Ser Glu Ser 50 55 60 Leu Phe Leu Ser Arg Ile Phe Gly Ser Glu Asp Arg Cys Pro Leu His 65 70 75 80 Leu Lys Glu Ala Ser Asp Lys Ile Leu Lys Arg Cys Gly 85 90 39 88 PRT Zea mays VARIANT (1)...(88) Xaa = Any Amino Acid 39 Phe Leu Leu Val Ile Asp Gly Asp Val Ser Asn Ser Asp Cys Xaa Ala 1 5 10 15 Ile Leu Ala Ala Leu Pro Glu Lys Asn Asn Gly Ser Arg Ile Val Arg 20 25 30 Ile Met Gln Gly Ile Arg Lys Xaa Pro Arg Gly Ile Ala Ala Lys His 35 40 45 Trp Ile Glu Leu Lys Cys Phe Glu Thr Glu Lys Thr Thr Ser Xaa Phe 50 55 60 Ser Gln Arg Val Cys Met Glu Glu Lys Ile Asn Ile Glu Asn Phe Asp 65 70 75 80 Glu Val Leu His Gly Ile Thr Lys 85 40 92 PRT Oryza sp. 40 Phe Phe Leu Val Leu Asp Asn Val Trp Asn Val Gln Lys Glu Ile Trp 1 5 10 15 Asp Ala Leu Leu Ser Leu Leu Val Gly Ala Gln Leu Gly Met Ile Leu 20 25 30 Leu Thr Thr Arg Asp Glu Thr Ile Ser Lys Met Ile Gly Thr Met Pro 35 40 45 Ser Tyr Asp Leu Ser Phe Leu Thr Ser Glu Glu Ser Trp Gln Leu Phe 50 55 60 Lys Gln Met Ala Phe Gly Phe Ile Asp Gln His Met Asp Gln Gln Phe 65 70 75 80 Glu Gly Phe Gly Arg Lys Ile Val Gly Lys Cys Gly 85 90 41 90 PRT Oryza sp. VARIANT (1)...(90) Xaa = Any Amino Acid 41 Phe Phe Leu Val Leu Asp Asp Val Trp His Tyr Lys Ala Trp Glu Asp 1 5 10 15 Leu Leu Arg Thr Pro Leu Asn Ala Ala Ala Thr Gly Ile Ile Leu Val 20 25 30 Thr Thr Arg Asp Glu Thr Ile Ala Arg Val Ile Gly Val Asp Arg Thr 35 40 45 His Arg Val Asp Leu Met Ser Ala Asp Val Gly Trp Glu Leu Leu Trp 50 55 60 Arg Ser Met Asn Ile Lys Xaa Glu Lys Gln Val Lys Asn Leu Arg Asp 65 70 75 80 Thr Gly Ile Glu Ile Val Arg Lys Cys Gly 85 90 42 93 PRT Oryza sp. 42 Phe Leu Leu Val Leu Asp Asp Val Trp Asn Glu Asp Pro Asp Arg Trp 1 5 10 15 Asp Arg Tyr Arg Arg Ala Leu Val Ala Gly Ala Lys Gly Ser Lys Ile 20 25 30 Met Val Thr Thr Arg Asn Glu Asn Val Gly Lys Leu Met Gly Gly Leu 35 40 45 Thr Pro Tyr Tyr Leu Lys Gln Leu Ser Tyr Asn Asp Ser Trp His Leu 50 55 60 Phe Ile Ser Tyr Ala Phe Val Asp Gly Asp Ser Ser Ala His Pro Ser 65 70 75 80 Leu Glu Met Ile Gly Lys Glu Ile Val His Lys Leu Lys 85 90 43 92 PRT Oryza sp. 43 Leu Phe Phe Val Leu Asp Asp Val Trp Ser Glu Cys Glu Ser Ser Trp 1 5 10 15 His Asp Leu Ile Arg Pro Leu Ser Tyr Ala Leu Thr Val Ile Ile Leu 20 25 30 Val Thr Thr Arg Ser Lys Glu Val Ala Arg Leu Ala Gly Thr Val Lys 35 40 45 Pro Phe Tyr Leu Thr Ala Ile Pro Asn Asp Asp Cys Trp Leu Leu Phe 50 55 60 Gln His Phe Ala Phe Gly Lys Gln Cys Val Asn Glu Lys Ser Ser Leu 65 70 75 80 Val Gln Ile Gly Lys Lys Ile Leu Gln Lys Cys Gly 85 90 44 90 PRT Sorghum sp. 44 Phe Phe Leu Val Leu Asp Asp Ile Trp Gln His Glu Val Trp Thr Asn 1 5 10 15 Leu Leu Arg Ala Pro Leu Asn Thr Ala Ala Thr Gly Thr Ile Leu Val 20 25 30 Thr Thr Arg Asn Asp Thr Val Ala Arg Ala Ile Gly Val Glu Asp Ile 35 40 45 His Arg Val Glu Leu Met Ser Asp Glu Val Gly Trp Lys Leu Leu Leu 50 55 60 Lys Ser Met Asn Ile Ser Lys Glu Ser Glu Val Glu Asn Leu Arg Val 65 70 75 80 Leu Gly Val Asp Ile Val Arg Leu Cys Gly 85 90 45 90 PRT Sorghum sp. VARIANT (1)...(90) Xaa = Any Amino Acid 45 Phe Leu Leu Val Leu Asp Asp Leu Trp Glu Ser Asp Val Trp Thr Asn 1 5 10 15 Leu Leu Arg Thr Pro Leu Ala Ala Ala Asp Gln Val Thr Ile Leu Val 20 25 30 Thr Thr Arg His Asp Thr Val Ala Lys Ala Ile Gly Val Gly His Met 35 40 45 His Arg Val Glu Leu Leu Ser Glu Glu Val Gly Trp Glu Leu Leu Trp 50 55 60 Lys Ser Met Xaa Ile Ser Ser Glu Lys Glu Val Leu Asn Leu Arg Glu 65 70 75 80 Thr Gly Ile Gly Ile Val Gln Lys Cys Gly 85 90 46 95 PRT Sorghum sp. 46 Phe Leu Leu Val Leu Asp Asp Met Trp Thr Asp His Glu Asp Glu Trp 1 5 10 15 Lys Lys Leu Leu Ala Pro Phe Lys Lys Met Gln Thr Lys Gly Asn Met 20 25 30 Ala Ile Val Thr Thr Arg Ile Pro Lys Val Ala Arg Met Val Ala Thr 35 40 45 Val Gly Cys Gln Ile Arg Leu Glu Arg Leu Ser Asp Glu Glu Cys Met 50 55 60 Cys Phe Phe Gln Glu Cys Val Phe Gly Asn Arg Gln Thr Trp Glu Gly 65 70 75 80 His Ala Ile Leu His Asp Phe Gly Tyr Lys Ile Val Lys Arg Leu 85 90 95 47 93 PRT Sorghum sp. VARIANT (1)...(93) Xaa = Any Amino Acid 47 Tyr Phe Ile Val Ile Asp Asp Ile Trp Ser Ile Ser Ala Trp Arg Thr 1 5 10 15 Ile Arg Cys Ala Phe Pro Glu Asn Asn Cys Ser Ser Arg Ile Leu Thr 20 25 30 Thr Thr Arg Ile Ile Thr Val Ala Lys Tyr Cys Cys Ser Pro Xaa Arg 35 40 45 Asp His Val Tyr Glu Leu Lys Pro Leu Asp Ala Ala His Ser Lys Ser 50 55 60 Leu Phe Phe Asn Arg Ile Phe Gly Ser Glu Asp Arg Cys Pro Leu His 65 70 75 80 Leu Lys Glu Val Ser Asn Gly Ile Leu Lys Lys Cys Gly 85 90 48 85 PRT Sorghum sp. VARIANT (1)...(85) Xaa = Any Amino Acid 48 Tyr Leu Ile Val Ile Asp Asp Ile Gln Met Asp Glu Trp Arg Thr Val 1 5 10 15 Lys Ser Val Phe Glu His Ser Ser Thr Ser Ser Arg Ile Ile Leu Thr 20 25 30 Thr Thr Ile Gln Pro Met Ala Asn Met Cys Ser Ser His Gly Asn Gly 35 40 45 Tyr Val Tyr Gln Met Asp Thr Leu Gly Glu Glu Asp Ser Lys Lys Ile 50 55 60 Ala Xaa Pro Gly Ile Arg Ser Pro Glu Leu Glu His Gly Ser Ala Ala 65 70 75 80 Leu Leu Arg Lys Cys 85 49 92 PRT Sorghum sp. 49 Tyr Leu Ile Ile Ile Asp Asp Ile Trp Ser Ile Leu Ala Trp Asp Ala 1 5 10 15 Ile Lys Tyr Ala Phe Pro Glu Asn Asn Phe Ser Ser Arg Ile Ile Ala 20 25 30 Thr Thr Arg Ile Val Asp Val Ala Arg Ser Cys Cys Leu Gly Gly Asn 35 40 45 Asp Arg Met Tyr Glu Met Glu Ala Leu Ser Gly Leu His Ser Lys Lys 50 55 60 Leu Phe Phe Lys Arg Thr Phe Gly Ser Glu Asp Cys Cys Pro Asp Val 65 70 75 80 Leu Lys Glu Val Ser Asn Glu Ile Leu Lys Lys Cys 85 90 50 91 PRT Linum unitatissimum 50 Ile Leu Val Val Leu Asp Asp Val Asp Glu Lys Phe Lys Phe Glu Asp 1 5 10 15 Met Leu Gly Ser Pro Lys Asp Phe Ile Ser Gln Ser Arg Phe Ile Ile 20 25 30 Thr Ser Arg Ser Met Arg Val Leu Gly Thr Leu Asn Glu Asn Gln Cys 35 40 45 Lys Leu Tyr Glu Val Gly Ser Met Ser Lys Pro Arg Ser Leu Glu Leu 50 55 60 Phe Ser Lys His Ala Phe Lys Lys Asn Thr Pro Pro Ser Tyr Tyr Glu 65 70 75 80 Thr Leu Ala Asn Asp Val Val Asp Thr Thr Ala 85 90 51 88 PRT Nicotiana glutinosa 51 Val Leu Ile Val Leu Asp Asp Ile Asp Asn Lys Asp His Tyr Leu Glu 1 5 10 15 Tyr Leu Ala Gly Asp Leu Asp Trp Phe Gly Asn Gly Ser Arg Ile Ile 20 25 30 Ile Thr Thr Arg Asp Lys His Leu Ile Glu Lys Asn Asp Ile Ile Tyr 35 40 45 Glu Val Thr Ala Leu Pro Asp His Glu Ser Ile Gln Leu Phe Lys Gln 50 55 60 His Ala Phe Gly Lys Glu Val Pro Asn Glu Asn Phe Glu Lys Leu Ser 65 70 75 80 Leu Glu Val Val Asn Tyr Ala Lys 85 52 90 PRT Lycopersicon esculentum 52 Phe Leu Ile Leu Ile Asp Asp Val Trp Asp Tyr Lys Val Trp Asp Asn 1 5 10 15 Leu Cys Met Cys Phe Ser Asp Val Ser Asn Arg Ser Arg Ile Ile Leu 20 25 30 Thr Thr Arg Leu Asn Asp Val Ala Glu Tyr Val Lys Cys Glu Ser Asp 35 40 45 Pro His His Leu Arg Leu Phe Arg Asp Asp Glu Ser Trp Thr Leu Leu 50 55 60 Gln Lys Glu Val Phe Gln Gly Glu Ser Cys Pro Pro Glu Leu Glu Asp 65 70 75 80 Val Gly Phe Glu Ile Ser Lys Ser Cys Arg 85 90 53 90 PRT Arabidopsis thaliana 53 Phe Leu Leu Leu Leu Asp Asp Val Trp Glu Glu Ile Asp Leu Glu Lys 1 5 10 15 Thr Gly Val Pro Arg Pro Asp Arg Glu Asn Lys Cys Lys Val Met Phe 20 25 30 Thr Thr Arg Ser Ile Ala Leu Cys Asn Asn Met Gly Ala Glu Tyr Lys 35 40 45 Leu Arg Val Glu Phe Leu Glu Lys Lys His Ala Trp Glu Leu Phe Cys 50 55 60 Ser Lys Val Trp Arg Lys Asp Leu Leu Glu Ser Ser Ser Ile Arg Arg 65 70 75 80 Leu Ala Glu Ile Ile Val Ser Lys Cys Gly 85 90 54 94 PRT Arabidopsis thaliana 54 Tyr Ile Val Val Leu Asp Asp Val Trp Thr Thr Gly Leu Trp Arg Glu 1 5 10 15 Ile Ser Ile Ala Leu Pro Asp Gly Ile Tyr Gly Ser Arg Val Met Met 20 25 30 Thr Thr Arg Asp Met Asn Val Ala Ser Phe Pro Tyr Gly Ile Gly Ser 35 40 45 Thr Lys His Glu Ile Glu Leu Leu Lys Glu Asp Glu Ala Trp Val Leu 50 55 60 Phe Ser Asn Lys Ala Phe Pro Ala Ser Leu Glu Gln Cys Arg Thr Gln 65 70 75 80 Asn Leu Glu Pro Ile Ala Arg Lys Leu Val Glu Arg Cys Gln 85 90 

That which is claimed is:
 1. An isolated nucleic acid molecule selected from the group consisting of: a) a nucleic acid molecule comprising a nucleotide sequence that encodes a polypeptide that confers disease resistance to a plant, said sequence having at least 85% sequence identity to the sequence set forth in SEQ ID NO: 34, 35, or a complement thereof; b) a nucleic acid molecule comprising a fragment of at least 300 contiguous nucleotides of the sequence set forth in SEQ ID NO: 34, 35, or a complement thereof, wherein said fragment encodes a polypeptide that confers disease resistance to a plant; c) a nucleic acid molecule that encodes a polypeptide comprising the amino acid sequence set forth in SEQ ID NO: 36; d) a nucleic acid molecule that encodes a fragment of a polypeptide comprising the amino acid sequence of SEQ ID NO: 36, wherein the fragment retains the ability to confer disease resistance to a plant and comprises at least 100 contiguous amino acids of SEQ ID NO: 36; e) a nucleic acid molecule that encodes a polypeptide that confers disease resistance to a plant, wherein the nucleic acid molecule hybridizes to a sequence of a), b), c), d), or a complement thereof under stringent conditions, said stringent conditions comprising hybridization in 40% formamide, 1.0 M NaCl, 1% SDS at 37° C., followed by washing with 1×SSC at 55° C.; and f) a nucleic acid molecule comprising a fragment of at least 50 contiguous nucleotides of SEQ ID NO: 34, 35, or a complement thereof, wherein said fragment hybridizes to a sequence of a), b), c), d), or a complement thereof under stringent conditions, said stringent conditions comprising hybridization in 40% formamide, 1.0 M NaCl, 1% SDS at 37° C., followed by washing with 1×SSC at 55° C.
 2. A DNA construct comprising a nucleotide sequence of claim 1 operably linked to a promoter that drives expression in a plant cell.
 3. A vector comprising the DNA construct of claim
 2. 4. A plant cell having stably incorporated in its genome the DNA construct of claim
 2. 5. A plant having stably incorporated in its genome the DNA construct of claim
 2. 6. A method for creating or enhancing disease resistance in a plant, said method comprising transforming said plant with a DNA construct comprising a nucleic acid molecule operably linked to a promoter that drives expression of a coding sequence in a plant cell and regenerating stably transformed plants, wherein said nucleic acid molecule is selected from the group consisting of: a) a nucleic acid molecule comprising a nucleotide sequence that encodes a polypeptide that confers disease resistance to a plant, said sequence having at least 85% sequence identity to the sequence set forth in SEQ ID NO: 34, 35, or a complement thereof; b) a nucleic acid molecule comprising a fragment of at least 300 contiguous nucleotides of the sequence set forth in SEQ ID NO: 34, 35, or a complement thereof, wherein said fragment encodes a polypeptide that confers disease resistance to a plant; c) a nucleic acid molecule that encodes a polypeptide comprising the amino acid sequence set forth in SEQ ID NO: 36; d) a nucleic acid molecule that encodes a fragment of a polypeptide comprising the amino acid sequence of SEQ ID NO: 36, wherein the fragment retains the ability to confer disease resistance to a plant and comprises at least 100 contiguous amino acids of SEQ ID NO: 36; and e) a nucleic acid molecule that encodes a polypeptide that confers disease resistance to a plant, wherein the nucleic acid molecule hybridizes to a sequence of a), b), c), d), or a complement thereof under stringent conditions, said stringent conditions comprising hybridization in 40% formamide, 1.0 M NaCl, 1% SDS at 37° C., followed by washing with 1×SSC at 55° C.
 7. The method of claim 6, wherein said plant is a dicot.
 8. The method of claim 6, wherein said plant is a monocot.
 9. The method of claim 8, wherein said monocot is selected from the group consisting of maize, sorghum, barley, rice, and wheat.
 10. The method of claim 6, wherein said promoter is a constitutive promoter.
 11. The method of claim 6, wherein said promoter is an inducible promoter.
 12. A plant stably transformed with a DNA construct comprising a nucleic acid molecule operably linked to a promoter that drives expression of a coding sequence in a plant cell, wherein said nucleic acid molecule is selected from the group consisting of: a) a nucleic acid molecule comprising a nucleotide sequence that encodes a polypeptide disease resistance activity, said sequence having at least 85% sequence identity to the sequence set forth in SEQ ID NO: 34, 35, or a complement thereof; b) a nucleic acid molecule comprising a fragment of at least 300 contiguous nucleotides of the sequence set forth in SEQ ID NO: 34, 35, or a complement thereof, wherein said fragment encodes a polypeptide that confers disease resistance to a plant; c) a nucleic acid molecule that encodes a polypeptide comprising the amino acid sequence set forth in SEQ ID NO: 36; d) a nucleic acid molecule that encodes a fragment of a polypeptide comprising the amino acid sequence of SEQ ID NO: 36, wherein the fragment retains the ability to confer disease resistance to a plant and comprises at least 100 contiguous amino acids of SEQ ID NO: 36; and e) a nucleic acid molecule which encodes a polypeptide that confers disease resistance to a plant, wherein the nucleic acid molecule hybridizes to a sequence of a), b), c), d), or a complement thereof under stringent conditions, said stringent conditions comprising hybridization in 40% formamide, 1.0 M NaCl, 1% SDS at 37° C., followed by washing with 1×SSC at 55° C.
 13. The plant of claim 12, wherein said plant is a dicot.
 14. The plant of claim 12, wherein said plant is a monocot.
 15. The plant of claim 14, wherein said monocot is selected from the group consisting of maize, sorghum, barley, rice, and wheat.
 16. The plant of claim 12, wherein said promoter is a constitutive promoter.
 17. The plant of claim 12, wherein said promoter is an inducible promoter.
 18. Seed of the plant of claim
 12. 19. Seed of the plant of claim
 13. 20. Seed of the plant of claim
 14. 21. Seed of the plant of claim
 15. 